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Introduction to the Gopakumar-Vafa Large N Duality 



Gopakumar-Vafa Large N Duality is a correspondence between Chern-Simons 
invariants of a link in a 3-manifold and relative Gromov-Witten invariants of a 
6-dimensional symplectic manifold relative to a Lagrangian submanifold. We 
address the correspondence between the Chern-Simons free energy of S 3 with 
no link and the Gromov-Witten invariant of the resolved conifold in great detail. 
This case avoids mathematical difficulties in formulating a definition of relative 
Gromov-Witten invariants, but includes all of the important ideas. 

There is a vast amount of background material related to this duality. We make 
a point of collecting all of the background material required to check this duality 
in the case of the 3-sphere, and we have tried to present the material in a way 
complementary to the existing literature. This paper contains a large section on 
Gromov-Witten theory and a large section on quantum invariants of 3-manifolds. 
It also includes some physical motivation, but for the most part it avoids physical 
terminology. 
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Introduction 

Large N duality is a conjectural correspondence between two very different types of 
mathematical objects: the large N limit of a gauge theory with structure group U(N) and 
a string theory. Since gauge theories and string theories are both meant to describe the 
same universe it is natural to expect a correspondence between their predictions. There 
are several examples in physics and mathematics literature of this apparent duality. The 
original description of this correspondence goes back to the 1974 theoretical physics 
paper [76] by 't Hooft. At the time the main computational tool in both theories was 
perturbative expansion and 't Hooft noticed intriguing similarities that occur in those 
expansions. One of the first mathematical papers related to a Large N Duality is a 1992 
paper where Kontsevich introduced a matrix model to resolve the conjectures of Witten 
about 2-dimensional gravity [88]. 

In oversimplified terms, gauge theory studies moduli spaces of connections on principal 
bundles while string theory studies spaces of maps from a particular class of domains 
into different targets. Both theories lead to invariants (of the base manifold in the gauge 
case and the target manifold in the string case). These invariants can be conveniently 
assembled into generating functions (called 'partition functions' by physicists), 't Hooft 
was considering expansions for gauge theories with S\J(N) structure groups and noticed 
that as N — > oo they turn into partition functions one expects from a string theory. 

In this generality 't Hooft's principle remains beyond the reach of mathematical theory 
for the foreseeable future. Ten years after Kontsevich's paper people began to understand 
the Large N Duality relating the Chern-Simons SU(A0 gauge theory of 3 -manifolds to 
the Gromov-Witten string theory of complex 3 -folds. 

The aspect of Large N Duality that we address in this survey is a duality between Chern- 
Simons theory and Gromov-Witten theory, the Gopakumar-Vafa Large N Duality 
duality. This duality states that the Chern-Simons (Reshetikhin-Turaev) invariants of 
a link in a 3 -manifold are related to the relative Gromov-Witten invariants of a 6- 
dimensional symplectic manifold relative to a 3 -dimensional Lagrangian submanifold. 
We address the correspondence between the Chern-Simons free energy of S with no 
link and the Gromov-Witten invariant of the resolved conifold in great detail. 

The key trait of both theories is that they are 'topological' in the sense that they do not 
depend on a background metric on the manifold in question. This greatly simplifies the 
setting and makes it possible to explain the Large /V Duality in mathematical terms, 
something that remains impossible for other examples. 
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Mathematically the most that can be done is to compute the invariant on the Gromov- 
Witten side and compute the invariant on the Chern-Simons side and compare the two 
answers. This is what we do. The first part of this paper (Part I) is an exposition of 
Gromov-Witten theory up to the point of the computation of the Gromov-Witten free 
energy of the resolved conifold (the full multiple cover formula). The definition of 
Gromov-Witten invariants is given in Section 3 with intuitive descriptions of some of 
the more technical elements. Formal definitions are given in the following subsections 
as they are motivated by and required for ever more complicated sample computations. 
We have included all of the relevant definitions. For example, the definition of a stack is 
given in Appendix A. Most expositions on Gromov-Witten theory avoid this definition. 

One of the unique things that we do in this paper is a computation of the genus 
zero, degree two Gromov-Witten invariant of the resolved conifold directly from the 
definition; see Section 10. This case is addressed via localization in the book [43] by 
Cox and Katz. 

The second part of this paper (Part II) is an exposition of Chern-Simons theory up to 
the point of the computation of the Chern-Simons free energy of the 3 -sphere. The 
Chern-Simons invariants were motivated by a path integral expression. We outline 
the progression from this heuristic definition to formal definitions of perturbative 
invariants in Sections 14 and 15 and the first two parts of Section 16. The motivation 
for introducing the free energy is explained in Section 14. 

The skein theory approach is described in Appendix C; this is the easiest way to describe 
the Chern-Simons invariants. It is however very difficult to compute from the resulting 
expressions, so we rely on the quantum group approach instead. 

The main subsection in the second part is Section 16. It is here that we motivate and 
give a formal definition of the quantum group invariants. The second part of the paper 
ends with the computation of the Chern-Simons partition function. 

Part III begins with Section 19 where we use special function techniques to derive the 
formal relation between the Gromov-Witten free energy and the Chern-Simons free 
energy. The remainder of this part is overview and history. 

To get a fast introduction to the Gopakumar-Vafa Large N Duality one may just read 
the overview or read the first definition of the Gromov-Witten invariants from Section 3, 
the skein theory definition of the Chern-Simons invariants from Appendix C and the 
comparison of the two in Section 19. Some physical intuition may be obtained from the 
description of the perturbative expansion in Section 15. 

Chern-Simons theory is defined for real 3-manifolds while the relevant Gromov-Witten 
theory is defined for Calabi-Yau complex 3-manifolds (in general one can define 
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Gromov-Witten invariants for arbitrary symplectic manifolds, see Part I). Thus the 
mathematically oriented reader can see Large N Duality as an interesting correspondence 
between 3-dimensional real and complex geometries and topologies. Physically the 
importance of the Calabi-Yau condition is that in string theory Calabi-Yau 3-folds (ie 
6-real dimensional manifolds) provide complementary 'compactified' dimensions to 
the 4 observed ones of the classical space-time. 

The existing literature on Large N Duality is vast and is growing exponentially so it 
would be impossible to survey it here. However, it appears to fall mostly into two 
categories: one written by physicists with extensive use of physical terminology (see the 
survey by Marino [104] and references therein), another by or for algebraic geometers 
(see Cox and Katz [43] and the second half of Hori et al [77]). This reflects the fact 
that the physical insight and the complex side of the duality are at present the most 
developed parts. 

One of the difficulties that impedes further progress in this subject is the amount 
of background material required to comprehend all the mathematical aspects of the 
conjectural duality (as one can judge, for example, from the size of the background 
chapters in Hori et al [77] and Turaev [152]). 

We tried to provide a relatively self-contained introduction to the existing ideas and 
methods involved in Large ,/V Duality that was complementary to the existing literature. 
We fill in details where we had trouble finding them and leave well-documented 
computations as exercises. 

In order to keep the size of the the paper manageable we stuck to topics that could be 
formalized mathematically. In particular we cover the duality between the 3 -sphere 
and the resolved conifold without including any knots. We also provide a number of 
computational and illustrative examples to make the matters clearer to a non-specialist. 
It is hoped that this paper will be accessible to advanced graduate students and will 
help to bring new blood into the field. Exercises are spread all over the text along 
with references to their solutions (or ideas for such). Although formally not necessary 
to get through the paper they are important for those who plan to acquire a working 
understanding of the subject matter. 

The idea of writing this paper dates back to the workshop Interaction of Finite-type and 
Gromov-Witten Invariants at the Banff International Research Station in November 
2003 co-organized by the first author and Jim Bryan. M Marino gave a mini-series 
of lectures at this workshop on physics and mathematics of the Gopakumar-Vafa 
conjecture (as the Large N Duality was dubbed at the time). The plan was to write up 
lecture notes accessible to mathematics graduate students. However, M Marino wrote 
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an introduction that followed his lectures fairly closely [104] and it became clear that 
more details could not be given without writing a fair amount of background material. 
The final version of the paper emerged out of friendly discussions between the two 
authors as we learned this material. 

We would like to thank the directors and staff of the Banff International Research Station 
for providing an amazing place to share mathematics. We would like to thank Marcos 
Marino for his illuminating lectures and Arthur Greenspoon for his careful editing of 
an earlier draft of this paper. The first author would like to thank Jim Bryan for first 
telling him about this duality. The second author would like to thank C-C M Liu and D 
Karp for fruitful discussions and suggestions related to the content of this work. Dave 
Auckly and Sergiy Koshkin were partially supported by NSF grant DMS-0204651. 

1 Mathematical history of Large N Duality 

It is instructive to trace the development of elements involved in the modern picture 
of Large N Duality. Hopefully this will also serve as a non-technical step-by-step 
introduction into the field. The reader should keep in mind that this is a mathematician's 
take on this task and a sporadic one at that. For instance, we almost completely ignore 
the physical undercurrent of the process except for a few landmark papers. A different 
perspective can be found in the introductory parts of Grassi-Rossi [67]. 

The history can be divided into four periods separated by physical breakthroughs into 
the mathematical realm: prehistory (1974-1989), formation of concepts and tools 
(1989-1998), the Gopakumar-Vafa conjecture (1998-2003), life after the vertex (2003- 
present). The dates in the text refer to arxiv submissions while references are given 
wherever possible to journal publications. 

This historical overview describes the state of the subject in 2003 before the seminal 
paper 'The topological vertex' [3] re-shifted the perspective. Accordingly in this essay 
we only address the developments in 1974-2003. New results and directions that 
appeared after the November 2003 workshop are surveyed (or rather sketched) in the 
last section of this paper. 

1.1 Prehistory (1974-1989) 

When 't Hooft first formulated the Large N Duality principle neither Chern-Simons 
theory nor Gromov-Witten theory existed. However, this period saw the appearance of 
many ingredients that later fit into the picture. 
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In 1983 H Clemens in a paper called 'Double Solids' (ie complex threefolds) [41] studied 
extremal transitions between threefolds that include deformations of complex structure 
into a singular one with subsequent resolution of the singularity. The conifold transition 
that connects cotangent bundles of 3-manifolds to their large N dual threefolds is an 
example of such a transition. The study of threefold singularities led to a conjecture 
known as Reid's fantasy (1987) that the moduli space of Calabi-Yau threefolds forms 
a single family related through such transitions (see Grassi and Rossi [67] and Reid 
[128]). 

At about the same time, physicists realized that string theories remain well-defined on 
varieties with certain singularities (see Dixon, Harvey, Vafa and Witten [49]) and can 
therefore change smoothly as the underlying manifolds undergo a singular transition. 
This idea that the same theory can be described on topologically different bases is at the 
heart of Large N Duality. 

On the other end, in 1985 V Jones introduced his polynomial invariant of knots [81]. 
However, his motivation came from operator algebras and no connection to Chem- 
Simons theory was known at the time. Already in 1985 several groups of authors 
generalized the Jones polynomial. Six authors published a joint paper [60] giving 
the new HOMFLY polynomial invariant (named with the initials of their last names). 
Two other authors [126] operated behind the iron curtain and their work remained 
unrecognized until somewhat later. To give full credit HOMFLY is now sometimes 
expanded to HOMFLYPT or THOMFLYP. It was later discovered that the Jones 
polynomial corresponds to the SU(2) and THOMFLYP to the SU(A0 Chern-Simons 
theories respectively. 

1.2 Formation of concepts and tools (1989-1998) 

This period starts with the appearance of the first [159] of three seminal papers by 
E Witten. These papers revolutionized both the Chem-Simons and the Gromov- 
Witten theories and then tied them together. In 'Quantum field theory and the Jones 
polynomial' Witten introduced the (quantum) Chern-Simons theory as a gauge theory 
on 3-manifolds with Lagrangian density given by the Chern-Simons form (see Witten 
[159] and Baez-Muniain [20]). He then 'solved' it, that is, found explicit mathematical 
expressions for expectations of observables by reducing the computation to conformal 
field theory on Riemann surfaces (see Di Francesco-Mathieu-Senechal [45] or Appendix 
E). The observables turned out to be the so-called 'Wilson loops', that is, holonomies of 
connections over knots and links in the manifold. Witten's heuristic computation showed 
that expectation values of Wilson loops are the Jones and THOMFLYP polynomials 
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with minor renormalizations. One can find a more mathematical account of Witten's 
ideas in M Atiyah's book [14]. 

Shortly after (1990-1991) Witten's results were put on a firm mathematical basis via 
quantum groups and conformal field theory. The quantum group approach was led 
by N Reshetikhin and V Turaev [129, 130, 84, 152], who redefined Witten's quantum 
invariants using the machinery of quantum groups and modular tensor categories (see 
Section 16). 

One first had to study perturbative expansions of the invariants rather than their exact 
expressions before the gauge/string duality would become apparent. Coefficients of such 
expansions represent another kind of knot invariants known as finite-type or Vassiliev 
invariants introduced in 1990 by V Vassiliev from completely different considerations. 
The connection between Chern-Simons theory and Vassiliev invariants along this line 
was established in the 1991 PhD thesis of D Bar-Natan (see also the subsequent papers 
[23, 24, 22]). The idea was to apply the the 'Feynman rules' from quantum field theory 
to the 'path integral' on the space of connections describing Chern-Simons theory (see 
Sections 14 and 15 for more details). Mathematically the universal finite type invariant 
now known as the Kontsevich integral was given in 1992 by M Kontsevich [91]. 

Similar developments also occurred in the field of 3 -manifolds. In 1991 S Axelrod 
and I Singer formalized the Feynman integral construction for perturbative Chern- 
Simons 3 -manifold invariants and in 1994 M Kontsevich demonstrated that his 
integral proved to be a universal finite-type invariant [89] . Finite-type invariants for 
3-manifolds were introduced by T Ohtsuki in 1996 [118] and in in 1998 T Le, J 
Murakami and T Ohtsuki gave a detailed construction of a Kontsevich-type invariant 
for 3-manifolds and proved its universality among the Ohtsuki invariants for integral 
homology spheres [94]. Now known as the LMO invariant it is believed to capture the 
trivial connection contribution to Witten's quantum invariant (see Rozansky [133] and 
Bar-Natan-Garoufalidis-Rozansky-Thurston [26]). 

Rapid progress on the complex side of the conjecture was initiated by Witten's 1991 
paper '2D Gravity and Intersection Theory on Moduli Space' [160]. In this paper 
Witten defines what are now called tautological classes on the moduli space of stable 
algebraic curves and gives string, dilaton and divisor equations that are sufficient to 
compute all of the corresponding intersection numbers (see Section 4, Hori et al [77] 
or Vakil [154]). These intersection numbers could be included in the framework of 
invariants of symplectic manifolds introduced in M Gromov's 1985 paper [69] on 
pseudoholomorphic curves (Gromov was more interested in topological applications). 

The next year M Kontsevich provided a proof of Witten's conjectured equations [88] 
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using a presentation of the moduli space by ribbon graphs and reducing the generating 
function for the intersection numbers to a matrix integral of Gaussian type [89] . This 
can be considered the first mathematical instance of Large N Duality The techniques 
of matrix models and graph combinatorics have become indispensable in computations 
related to Large ,/V Duality. 

In 1994 M Kontsevich generalized the definition of stable algebraic curves to stable maps 
[90] and paved the way to general definitions of (closed) Gromov-Witten invariants 
introduced by various authors in 1996 (see the discussion in McDuff-Salamon [109] 
or Cox-Katz [43]). Another important achievement of this paper is the discovery of 
the 'localization', technique for computing Gromov-Witten invariants (see Section 6). 
The basic idea dates back to the Atiyah-Bott paper [15] that shows how to compute an 
integral of functions equivariant under a torus action by localizing to an integral over 
the fixed points of the action. In 1997 this was generalized to 'virtual localization' by 
Graber and Pandharipande [66] . 

The third and final Witten paper that we ought to discuss 'Chern-Simons gauge theory 
as a string theory' was made available in 1992 although the first printed version appeared 
only in 1995 in the Floer Memorial Volume [161]. It contains an outline of the first 
step in the Large N Duality between the Chern-Simons gauge theory and a theory 
of open strings, so-called 'holomorphic instantons at infinity'. The second and more 
complicated step in 't Hooft's large N program that involves transition from open to 
closed strings had to wait until later. 

As described above, quantum invariants of knots and links correspond to expectation 
values in Chern-Simons theory. Witten took the next step and along with a 3-manifold 
M considered its cotangent bundle T*M with its natural symplectic structure. This 
allows one to define (pseudo-)holomorphic curves (stable maps) in T*M and 'count' 
them with Gromov-Witten invariants. For this idea to work it is important that the 
dual threefold is a Calabi-Yau so that holomorphic curves (stable maps) are generically 
isolated and can be counted. Holomorphic curves ending on the zero section of T*M are 
supposed to be the holomorphic instantons of Witten's theory. Witten argues moreover 
that the duality is exact, that is, it holds not only for large but for all N. The last property 
is due to the topological nature of the Chern-Simons theory. The partition function of 
this theory corresponds to the generating function of the Gromov-Witten invariants of 
holomorphic curves which is dual to the generating function of Chern-Simons invariants 
of knots and links. 

There is one major problem with this picture. A cotangent bundle is a Calabi-Yau 
but a very degenerate one, in particular there are no non-constant holomorphic curves 
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there either closed or ending on the zero section. This circumstance was well known 
to Witten and is explicitly pointed out in [161]. Recall that in 1992 there was no 
notion of Gromov-Witten invariants and even the modern 'translation' of holomorphic 
instantons as stable maps is incomplete. The physical notion is broader and includes 
objects like framed trivalent graphs, in particular framed knots and links in the zero 
section, a.k.a. instantons at infinity. One can think of them as made of infinitely thin 
ribbons thus representing degenerate Riemann surfaces with boundary. Even today we 
do not have a grand theory that would incorporate such degeneracies. Some potential 
contenders have emerged recently (for example symplectic field theory, see more in the 
last Section 20). In a way, one can view the theory of Chern-Simons link invariants 
as 'the Gromov-Witten theory' of cotangent bundles. Building up on this idea R 
Gopakumar and C Vafa completed the 't Hooft's program by finding in 1998 a dual 
theory of closed strings that turned out to be a 'true' Gromov-Witten theory but not on 
T*M. 



1.3 The Gopakumar-Vafa conjecture (1998-2003) 

Recall that string theories may change smoothly as the target space passes through 
some mild singular transitions. Gopakumar and Vafa conjectured that the dual theory 
of closed instantons lives on a threefold obtained from T*M via such a transition. They 
succeeded in finding the dual threefold for M = S 3 [65]. The corresponding transition 
known only to few algebraic geometers from Clemens' paper [41] was very explicitly 
described in 1990 by P Candelas and X De La Ossa [39]. It involves the zero section 
for T*S 3 shrinking to a nodal point and then getting resolved into an exceptional CP 1 . 
Their terminology of deformed and resolved conifolds and the schematic picture of the 
conifold transition have since migrated from one paper to the next. They also showed 
that all three actors in the conifold transition admit Calabi-Yau metrics and computed 
them explicitly. The manifold on the resolved side of the transition is just the sum 
0{— 1) © 0{— 1) of two tautological bundles over CP 1 . It is now called the resolved 
conifold even by mathematicians. 

Intuitively, as the zero section in T*S 3 collapses into the nodal point the 'open curves' 
that end on it close up and stay closed after the resolution of singularity. Thus, 
Gopakumar and Vafa conjectured that Witten's open string theory on T*S 3 turns into 
the usual Gromov-Witten theory of closed holomorphic curves on 0{— 1) © 0(— l),in 
short: 

_ Q large N Duality _ _ „ 

CS theory on S 3 GW theory on 0{- 1) © O(-l) 
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This was the original meaning of the Gopakumar-Vafa conjecture on the level of 
partition functions. 

Physicists make many claims like this; one reason this particular conjecture generated 
so much excitement in the mathematical community is that mathematical machinery 
was just mature enough to define both sides of the duality (if not the connection between 
them) in rigorous terms. 

Another reason is that Gopakumar and Vafa did not stop at a general physical claim but 
made two important and completely mathematical predictions. First, based on Witten's 
Chern-Simons computations they predicted an explicit form of the Gromov-Witten 
free energy function (and thus all closed Gromov-Witten invariants) for the resolved 
conifold. This can be compared to the prediction made by another physical duality, 
Mirror Symmetry, for the Gromov-Witten invariants of projective quintics (see Cox- 
Katz [43] and Hori et al [77]). Second, although Gromov-Witten invariants themselves 
are rational numbers (see Section 5) they can be represented as combinations of different 
numbers, later named the Gopakumar-Vafa invariants, that are integers. This integrality 
prediction later replaced the original meaning of 'the Gopakumar-Vafa conjecture' 
among algebraic geometers. The environment was so ripe that the prediction of the 
Gromov-Witten free energy was verified the same year by C Faber and R Pandharipande 
[55] by an explicit localization computation. The integrality conjecture for the resolved 
conifold was proved by A Okounkov and R Pandharipande in 2003 [119]. 

On the Chern-Simons side of the duality significant progress was made in computational 
techniques. In 1999 S Garoufalidis, D Bar-Natan, L Rozansky and D Thurston discovered 
an effective algorithm for computing the LMO invariant [26, 25]. It was called the 
Erhus integral (in honor of the city in Denmark where they started their work in 1995) 
and uses the graphical calculus of Bar-Natan and formal Gaussian integration. The 
same year R Lawrence and L Rozansky gave a representation for the SU(2) LMO 
invariant in terms of integrals and residues distinguishing contributions from different 
flat connections. This type of representation led to the discovery by Marino [105] of a 
relation between the LMO invariants and matrix integrals. 

The next year D Bar-Natan and R Lawrence used surgery and the Erhus integral to 
give an explicit formula for the LMO invariant of Seifert fibered homology spheres 
[27]. Contributions from non trivial flat connections for Seifert fibered spaces were 
determined in 2002 by M Marino using physical arguments [105]. Finally, in 2002 
combinatorial expressions for the Reshetikhin-Turaev version of Witten's invariants 
associated to an arbitrary compact Lie group were derived for Seifert fibered spaces by 
S Hansen and T Takata [71]. 
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The predictions in Gopakumar-Vafa [65] did not involve mathematical expectations 
of Chern-Simons observables, that is, quantum link invariants. This was rectified in 
a 2000 paper by H Ooguri and C Vafa [121], who conjectured that to each framed 
knot in S 3 there corresponds a Lagrangian submanifold in the resolved conifold and 
polynomial invariants of the knot give the generating function of open Gromov- 
Witten invariants. The latter 'count' open holomorphic curves with boundaries on the 
Lagrangian submanifold. For the case of the unknot Ooguri and Vafa gave an explicit 
construction of the corresponding Lagrangian submanifold in 0(— 1) © 0{— 1), and 
predicted the generating function of open invariants and their integral structure analogous 
to the Gopakumar-Vafa invariants in the closed case. A number of explicit computations 
for nontrivial knots followed on the Chern-Simons side (see Labastida-Marino-Vafa 
[93] and references therein). 

Unfortunately, the situation on the Gromov-Witten side did not develop as successfully. 
For one thing, unlike in the closed case the definition of the open Gromov-Witten 
invariants was lacking. Also, it was not clear how to generalize the Ooguri- Vafa 
construction of the Lagrangian to nontrivial knots. Nevertheless, in 2001 two groups 
of researchers succeeded in verifying the unknot predictions of Ooguri and Vafa. To 
compute the as-of-yet undefined invariants S Katz and C-C M Liu used the virtual 
localization technique as applied by Faber and Pandharipande in the closed case. T 
Li and Y S Song on the other hand avoided the use of open invariants altogether [96] 
by replacing them with relative Gromov-Witten invariants, the theory of which was 
developed by J Li at the same time [97]. A year later M Liu gave a rigorous definition 
of open invariants for the case when the Lagrangian submanifold is invariant under a 
torus action [99]. 

On the other front J M F Labastida, M Marino and C Vafa generalized the Ooguri- Vafa 
construction to all algebraic knots in 2000 [93] and in 2002 C H Taubes extended it 
even to all symmetric knots [149]. S Koshkin later gave a different construction for 
a Lagrangian associated to a knot that is valid for any knot [92] . By the time of the 
Banff workshop in 2003 the time seemed right for a general theory of open invariants to 
emerge and predictions of the Large N Duality to be verified. However, this did not 
happen. 

The difficulties proved to be much more significant than in the closed case: Lagrangian 
submanifolds for nontrivial knots do not admit convenient torus actions so the standard 
computational techniques do not apply; moreover, the available definition of invariants 
does not work in those cases. At the same time (2003) M Aganagic, A Klemm, M Marino 
and C Vafa discovered that all closed Gromov-Witten invariants of toric Calabi-Yau 
threefolds could be computed by 'slicing' them along Lagrangians corresponding to 
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framed unknots. Their algorithm known as 'the topological vertex' [3] captured the 
attention of mathematicians and shifted the focus away from general knots. Shortly 
after, the topological vertex was restated in a rigorous form by J Li, C-C M Liu, K Liu 
and J Zhou [95] using relative invariants and thus eliminating a need for open ones 
altogether as far as toric varieties are concerned. 

Thus 2003 closes the Gopakumar-Vafa period of research and we conclude our historical 
excursion. Progress made after 2003 went in a different direction and we shall give 
some indications about this in the last section of this paper. 

2 Overview 

In this section we introduce some minimal notation to explain in a nutshell what the 
rest of this fat paper is all about. The Gopakumar-Vafa Large N Duality [65] is a 
correspondence between two theories, one defined on S 3 and the other on 0{— 1) © 
0(— 1). Here 0(— 1) is the tautological line bundle over CP 1 defined by 



The projective line CP 1 is the collection of complex lines through the origin in C 2 and 
the 0{— 1) bundle is simply the collection of pairs consisting of a line together with a 
point on that line (hence the name 'tautological line bundle'). The projection just maps 
each point to the corresponding line. The number — 1 refers to the fact that the first 
Chern class of this bundle evaluates to — 1 on the CP 1 cycle. 

Intuitively, the large N correspondence can be traced to a geometric relation between 
the two spaces called the conifold transition T*S 3 ~~» 0{— 1) © 0{— 1). To understand 
this transition consider T*S 3 ~ TS 3 realized as SL2C in C 4 ~End(C ): 



Exercise 2.1 Consider the standard embedding S 3 w M 4 as the unit sphere inducing 
the embedding TS 3 <— > M. 4 x IR 4 ~ C 4 and find an explicit automorphism C 4 — > C 4 
that restricts to a diffeomorphism TS 3 — ► SL2C and S 3 — ► SU(2) (see Koshkin [92]). 

Now set det(W) = \i in equation (1). Taking \i — ► produces a complex deformation 
of T*S 3 into a singular variety that physicists and mathematicians call the conifold (it 




(1) 
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has an ordinary double point at the origin and is a higher dimensional analog of the 
usual double cone as in Figure 2.1): 



X 



s 3 ■= 



W\ W2 
VV3 W4 



£ End(C 2 ) 



det(W) = }. 



The conifold admits a small resolution X S i — > X S 3 of the singularity (see Harris [73]) 




Figure 2.1: The 'conifold' transition two dimensions down S x 



1 X.TB1...C0 



S u xl 



small meaning that the exceptional locus p 1 (0) is a curve rather than a surface as in 
the case of a blow-up. This resolution is the resolved conifold. 



Definition 2.2 The resolved conifold is 



X s i = < ([zi :z2],wi,W2,w 3 ,w 4 ) G CP 1 x C 4 
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The resolved conifold X S 3 is easily seen to be biholomorphic to 0(— 1) © 0(— 1). 

Geometrically the deformation shrinks the zero section (ie 5 ) into the double point and 
the resolution replaces it with an exceptional CP 1 so topologically we have the surgery 
S 3 x M 3 -w R 4 x S 2 . A low-dimensional analog of the conifold transition is shown in 
Figure 2.1. 

The resolved conifold X S 3 is a Kahler manifold and one can show that the expected 
dimension of the space of stable maps (holomorphic curves) from a Riemann surface 
into X S 3 is zero. Thus such maps are (formally) isolated and it makes sense to count 
them. The Gromov-Witten invariant N^^X^) is intuitively the number of maps of 
genus g surfaces representing the homology class JfCP 1 ]. The actual numbers can be 
fractional because one must assign fractional weights to curves with automorphisms. 
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Invariants defined in this way can be conveniently collected into the full Gromov-Witten 
free energy (see Part I for details): 

oo oo 
g=0 d=0 

On the other side, the unnormalized Chern-Simons free energy is the logarithm of its 
partition function := lnZ cs (S 3 ). Again intuitively, the latter is the value of the 
path integral: 

Z CS (S 3 ) = / e iCS(A) VA, 
J A 

taken over the space of connections on a trivial SU(A0 bundle over S 3 . In this formula 

CS{A) := — / Tr(A A dA + lA A A A A) 
4vr J M 

is the SU(N) Chern-Simons action, where k is an arbitrary integer called the level. 
Thus the unnormalized Chern-Simons free energy is a function of two parameters k, N 
encoded as N and x := just as the full Gromov-Witten free energy is a function of 
t,y. 

Today there is a multitude of rigorous constructions that define invariants Z CS (M) for 
any 3-manifold M that have the properties that one would conjecture based on heuristic 
path integral manipulations. Most of them use the fact due to Lickorish and Kirby that 
any 3-manifold M can be presented as a surgery on a framed link Lm and two different 
links present the same 3-manifold if and only if they are related by a sequence of the 
so-called Kirby moves (see Prasolov-Sossinsky [125] and Turaev [152]). Therefore, 
if one can come up with a framed link invariant that does not change under the Kirby 
moves one gets a 3-manifold invariant. N Reshetikhin and V Turaev were the first ones 
to come up with a systematic procedure for constructing Kirby-move invariant link 
invariants. Their invariants were based on the theory of quantum groups [129, 130]. We 
use a version of their construction in Sections 16.7 and 17. The corresponding invariant 
which is just the THOMFLYP polynomial at roots of unity can also be constructed in a 
number of other ways: skeins, TQFT, etc that all lead to the same quantity identified 
with the above partition function. We briefly touch on the skein and TQFT approaches 
in Appendices C and E respectively. 

We now give an intuitive idea how the Gopakumar-Vafa duality arises from the physics 
of string theory (we are grateful to M Marino for explaining this to us). As mentioned in 
the history section physicists work with a very broad notion of 'holomorphic instantons' 
described by a topological version of string theory known as the 'topological A-model'. 
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Holomorphic instantons live in Calabi-Yau threefolds and can be closed or open, that 
is, have boundary. In the latter case their boundary lies on 'D-branes', located at 
('wrapped around') special Lagrangian submanifolds of the threefold. Open and closed 
holomorphic curves and stable maps are examples of holomorphic instantons but there 
are more degenerate ones as well, for example 'instantons at infinity' (see Witten 
[161]) - trivalent ribbon graphs in Lagrangian submanifolds representing infinitely thin 
'curves with boundary'. Physical quantities produced by the theory are called 'string 
amplitudes' and in good cases they can be identified with Gromov-Witten invariants 
of the threefolds. The Calabi-Yau condition is needed to make sure that holomorphic 
instantons are isolated and can be 'counted' with finite amplitudes. 

Now consider two extreme cases of this picture. The first case is when there are no 
'honest' holomorphic curves as in cotangent bundles T*M to 3-manifolds since the 
symplectic form on them is exact. At the same time, the zero section is a special 
Lagrangian submanifold and knotted trivalent graphs in M (framed knots and links in the 
simplest case) can be seen as degenerate instantons with boundary. Witten discovered 
in [161] that the string amplitudes of a cotangent bundle with 'N D-branes wrapped 
around the zero section' can be recovered from the quantum S\J(N) Chern-Simons 
invariants of M computable via the surgery prescription from his earlier paper [159]. 
Moreover, one can also consider instantons ending on conormal bundles to links in 
M that are also special Lagrangian in T*M. This time the string amplitudes coincide 
with the the quantum SU(A0 link invariants. In other words, the topological A-model 
reduces to the quantum Chern-Simons theory in this case and can be viewed as 'the 
Gromov-Witten theory' of cotangent bundles. K Fukaya gives this idea a more precise 
meaning in terms of Floer homology in [61]. 

In the second case there are no D-branes in the picture and the only holomorphic 
instantons that remain are closed stable maps. This is the case of the resolved conifold 
and its usual Gromov-Witten theory. A striking feature of string theory discovered by 
Dixon, Harvey, Vafa and Witten [49] is that physically equivalent models can be set 
in different 'geometric backgrounds', that is, live on different threefolds. This occurs 
when the geometric backgrounds are related by special geometric transitions. Whereas 
physical quantities do not change, the underlying threefold may undergo a singular 
transition as some of the D-brains and/or holomorphic homology classes collapse or 
appear. The conifold transition is the simplest example of such a geometric transition. 

Open instantons that end on the zero section close up as they shrink to the nodal point 
and transform into closed holomorphic curves in the resolved conifold. Unlike the 
zero section conormal bundles to links do not collapse and reappear as Lagrangian 
submanifolds in the resolved conifold. Instantons at infinity that ended on them therefore 
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transform into open holomorphic curves. String amplitudes computed on both sides of 
the transition should be the same since the physics does not change. In a nutshell, this 
is the insight behind the Gopakumar, Ooguri and Vafa predictions of equality between 
the Chern-Simons 3-manifold and link invariants on one side and closed and open 
Gromov-Witten invariants on the other (see Gopakumar- Vafa [65] and Ooguri- Vafa 
[121]). 

The name Large N Duality comes from the specific way string amplitudes for instantons 
at infinity are recovered from the Chern-Simons invariants. One needs to consider the 
latter not for a specific rank but for all ranks N. The resulting function turns out to 
be analytic in i around modulo some logarithmic terms and can be expanded into 
a Laurent series. The coefficients of this series are the string amplitudes in question. 
From the perspective of knot theory this means that string amplitudes are given not by 
the 'exact' invariants (such as Jones or THOMFLYP polynomials) but by the so-called 
Vassiliev (or finite-type or perturbative) invariants [22, 25]. One should not be misled by 
the name into believing that the duality holds at large ,/V only. The Laurent coefficients 
match the Gromov-Witten invariants of the resolved conifold at all powers of 4 and 
the duality is exact. 

One of the most attractive traits of the Large ,/V Duality is its computational power. It is 
usually much easier to compute gauge theory quantities (partition functions, correlators, 
etc) than the corresponding string amplitudes. This is due to the apparatus of informal 
but effective path integral manipulations successfully applied by physicists for quite 
some time. String theory techniques (such as equivariant localization) are more recent 
and much more cumbersome. It turns out that contributions from instantons at infinity 
can be reduced to path integrals and even those contributions of honest holomorphic 
curves can be represented by path integrals with extra insertions. Thus the computational 
machinery of the gauge theory becomes available for string theories as well. One 
remarkable achievement of this approach is the topological vertex algorithm originally 
derived from the Chern-Simons path integral. This algorithm computes Gromov-Witten 
invariants of all toric Calabi-Yau threefolds (see Aganagic-Klemm-Marino-Vafa [3] 
and Marino [104]). 

At present we are very far removed from a mathematical definition of the topological 
A-model in anywhere near the generality used by physicists. However, the prediction 
of the equality of the Gromov-Witten and Chern-Simons free energies or partition 
functions on X S 3 and S 3 is a well-defined mathematical statement. This equality should 
not be taken too literally, one has to renormalize and change variables to make it work. 
But it is true that one function can be recovered from the other as Gopakumar and Vafa 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



215 



convincingly demonstrated by correctly predicting the values of the Gromov-Witten 
invariants of X S 3 in [65]. 

The computation that verifies the Gopakumar-Vafa predictions was originally done 
by C Faber and R Pandharipande in [55] but it does not cover the Chern-Simons side 
relying on the formulas obtained by path integral methods. Later papers [67, 104] that 
compute and compare both free energies skip many of the details to stay within a limited 
length. In this paper we provide the background material on both theories necessary to 
understand the structures behind these computations and reproduce the computations 
themselves (Section 19). The result of comparison can be packaged in the following 
form: 

Theorem 2.3 The full Gromov-Witten free energy and the unnormalized Chem- 
Simons free energy are related by 

Re(F$J(iNx,x) - F^ 3 s (N,x)) = ^ In* + ((3)x~ 2 - \ ln(2vr) - C'(-l). 

Some comments are in order about the form of this formula. First of all, even though the 
free energies are complex- valued the relevant coefficients in their expansions, that is, the 
invariants themselves are real so it suffices to consider only the real parts. Secondly, the 
free energies fail to be holomorphic in x at zero where the expansions are taken, but they 
do so in a very minor way. The terms on the right appear as a result of regularization 
and do not indicate any meaningful discrepancy. 

To appreciate how powerful this theorem is note that the full Gromov-Witten free energy 
encapsulates the Gromov-Witten invariants of the resolved conifold in all degrees 
and all genera. In its turn, the Chern-Simons partition function for the Hopf link 
contains the sl n C Vassiliev invariants of all knots and links in the three-sphere. It should 
come as no surprise that the duality for 'just' this one example led to computation of 
the Gromov-Witten invariants for all local Calabi-Yau threefolds [3, 104]. Despite 
its somewhat unappealing form this formula is a very strong confirmation of the 
Gopakumar-Vafa conjecture. We finish this introduction by stating a far-reaching 
generalization of Theorem 2.3 suggested by M Marino in his Banff lectures. 

Conjecture 2.4 For every rational homology 3 -sphere M there exists a large N dual 
Calabi-Yau threefold Xm such that the Chern-Simons theory on M is equivalent to 
the Gromov-Witten theory on Xm- In particular, the corresponding invariants can be 
recovered from each other. 

For a reader who may think that the task of learning so much algebraic geometry 
and quantum algebra is overwhelming we promise that learning these complex but 
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remarkable structures is well worth the effort. While navigating the deep waters of 
abstraction the reader should always keep in mind that we are merely computing two 
complex- valued functions - the free energies of X S 3 and S 3 . 
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Parti 



Gromov-Witten invariants 



The theory of Gromov-Witten invariants is the mathematical theory closest to string 
theory in physics. These invariants arise as generalizations of enumerative invariants. 
In this part, we will outline the definition of Gromov-Witten invariants and give some 
sample computations. 

The first ingredient in understanding these invariants is the cohomological interpretation 
of intersection theory. As a simple example consider counting the number of zeros 
of a degree d polynomial in C[x] , say p(x). The answer is easier when we are using 
complex coefficients. In more general counting problems the answers will be more 
uniform if we work in complex projective spaces. Given a degree d polynomial p 
we can define a function,^,: CP 1 — > CP 1 given by f p ([z : w]) = [p(z/w)w d : w d ]. 
This induces a map on the second cohomology, f* : tf^CP 1 ; Z) // 2 (CP ! ; Z) . Since 
H 2 (CP ;Z) = Z, this map is just multiplication by some integer. This integer is known 
as the degree of the map and it coincides with the degree of the original polynomial. 
We can write this as 



Here # represents a signed count of a set of points in general position. We will later 
describe methods to address non-generic situations. The integral represents the cap 
product pairing of homology and cohomology, so [CP 1 ] is the fundamental homology 
cycle (CP 1 has a natural orientation coming from the complex structure) and w CP i 
is the orientation class. Using the de Rham model for cohomology, the integral will 
become an honest integral. 

Part I provides a correspondence between geometric intersections and cohomological 
operations. We will describe various lines in this table as we use them. General 
topological folk wisdom suggests thinking via intersections and proving via cohomology. 

We now turn to a more serious motivating question: how many lines pass through two 
generic points in a plane? More generally, how many degree d parameterized curves 
pass through the 'right' number of points in a plane modulo reparameterization of the 
domain? We may describe the space of lines with two marked points in the plane as 



M , 2 (CP 2 ,1[CP 1 ]) = 

{u: CP 1 -» CP 2 , pi,p 2 | mJCP 1 ] = UCP 1 ],^ = 0,pi ^p 2 £ CP 1 }/ ~ . 
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Intersections 


Cohomology 


A codimension k homology submanifold, A 


A cohomology class a e H k (M; Z) 


#(A n F) 


a([F]) = a n [F] = J [F] a 


AC\B 


aU/3 = a/\/3 


r l (A) 


f*a 




ci(L) ore(£) € ff r (X) 




Thorn class $ <E H r (E, E - a (X)) 



Table 2. 1 : Geometric intersections vs cohomology 



Maps in this set are explicitly given by u([z '■ w]) = [az + bw : cz + dw : ez +fov] . Now 
count the dimension of this space. There are 6 complex parameters in the definition 
of our degree one parameterized curve. However the points in the projective plane 
are only defined up to a scale, so we subtract one parameter. We wish to count two 
maps as equivalent if they are related by a reparameterization of the domain (this 
is what the ~ represents in the equation for Mq ( 2(CP , lfCP 1 ])). The holomorphic 
isomorphisms (reparameterizations) of CP 1 are just the linear fractional transformations. 
More explicitly, we define (u,p\,p2) ~ (v, q\-,qi) to hold if and only if there is a linear 
fractional transformation, say ip, such that u = v o p and p(pk) = qk- A similar count 
allows one to conclude that the space of linear fractional transformations has complex 
dimension 3 . It follows that the space of complex projective lines in the complex 
projective plane has complex dimension two. Adding two points in the domain adds 
two more complex parameters, so the complex dimension of Mo^CP 2 , lfCP 1 ]) is four. 

Now we have two natural evaluation maps taking Mo,2(CP 2 , lfCP 1 ]) to CP 2 , given by 
evj.([M,pi,/?2]) := u(pk). Since CP 2 is two complex-dimensional the following integral 
makes sense and represents the number of lines passing through two points: 

/ ev\uj£p2 A ev2o; CP 2 

./[Mo^CPMlCP 1 ])] 

There is an infinite number of lines passing through one fixed point, and there are no 
lines passing through three generic points in the plane, thus two is the 'right' number of 
points to mark when counting lines. 

Exercise 2.5 Count the dimension of the space of degree d genus zero parameterized 
curves into CP 2 modulo reparameterization of the domain. Write an expression similar 
to the integral above representing the number of such curves through the 'right' number 
of points. We will outline two different ways to compute these numbers later in Part I. 

In the next section we will define the Gromov-Witten invariant (71, . . . ,7n)?fl The 
intuitive interpretation of (71 , . . . , In)* p is the number of genus g curves in the class 
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(3 that intersect the cycles T\, . . . ,T n Poincare dual to 71, . . . ,7„. In general, this 
interpretation fails. In fact, (71 , . . . , 7n)g ^ are only rational numbers, not integers. This 
is because curves must be counted with fracitonal weights to get a correct definition. 

3 The coarse moduli space 

Gromov-Witten invariants extend the ideas described in the above problem. These 
invariants may be defined for symplectic manifolds or for projective algebraic varieties. 
We mostly use the symplectic definition in this section, but give some idea of the 
algebraic definition at the end. 

3.1 The symplectic construction 

Recall that a symplectic manifold is a (real) 2n-dimensional manifold with a 2- 
form uj such that duj = and (nl)~ l uj An is a volume form on X. Any symplectic 
manifold admits a compatible almost complex structure. An almost complex structure 
/ € r(End(7X)) is an endomorphism of the tangent bundle which squares to negative 
one J 2 = —I . Such is compatible with a symplectic form, say uj , if the tensor defined by 
g(X, Y) = u(X, JY) is a Riemannian metric. Any two of g, uj or / uniquely determine 
the third via the compatibility condition. The standard symplectic structure on C is 
given by uj = ^dz A dz, and the standard symplectic structure on CP 1 is given by 

i(dz A dz, + dw A dw) 
U = 2(\z\ 2 + \w\ 2 ) 2 ' 

Together with the standard complex structure this produces a round metric of radius of 
2 on the Riemann sphere CP 1 . There are similar symplectic structures on all complex 
projective spaces. The standard complex structure on C considered as a real vector 
space is just multiplication by i. In the basis {1, i} it is given by the matrix 

-(:-:)■ 

In tensor notation it is given by 

/ = d y ® dx — d x ® dy = id z ® dz — id- z ® dz,- 

In a local chart the complex structure on any complex manifold takes this form 
(generalized in the obvious way to C"). The product of two symplectic manifolds, 
(X/c, ujk), k = 1 , 2, is the symplectic manifold (Xi x X2,p\uJ\ + p^i) where pu are the 
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natural projections. Thus CP 1 x C 4 inherits a natural symplectic structure as a product. 
This leads to a symplectic structure on our main example, the resolved conifold 

X S 3 := |[zi,Z2],(wi,W2,W3,w 4 ) G CP 1 x C 4 

This is our main example because it will turn out to be the large N dual of S 3 . We will 
describe the corresponding moduli space in Section 9, compute the Gromov-Witten 
invariants in Section 1 1 and see that this is the large N dual of S 3 in Section 19. 

The notion of a holomorphic curve can be generalized from algebraic varieties to 
symplectic manifolds. A symplectic manifold with a compatible almost complex 
structure provides exactly the data needed for the target of a pseudoholomorphic curve. 
A Riemann surface E has an associated almost complex structure which is automatically 
a complex structure denoted by j. The Cauchy-Riemann operator is defined on a map 
u: E -» X by 

du=\ (du + J u o du o j) . 
By definition, a map u is pseudoholomorphic when du = 0. 

Exercise 3.1 Write out the Cauchy-Riemann operator on maps / : C —> C using x-y 
coordinates on the first factor and u-v coordinates on the second assuming the natural 
symplectic and almost complex structures. 

Definition 3.2 A smooth, genus g, n -marked, pseudoholomorphic curve in X is a 
tuple (Ej, u,pi , . . . ,p n ) where E is an oriented genus g surface, j is an almost complex 
structure on E , u is a pseudoholomorphic map u : E — ► X and G E are distinct. 
A morphism between (E J, u,p\, . . . ,p n ) and (E',/, u',p[ , . . . ,p' n ) is a holomorphic 
map, ip: E — > E', such that u' o tp = u and <p(j?k) = p'u- A genus g, ra-marked, 
pseudoholomorphic curve in X will be called stable if it has a finite automorphism 
group. 

We can now define the (coarse) moduli space of genus g, w-marked, stable pseudo- 
holomorphic curves in a homology class /? G H2(X; Z) to be the set of equivalence 
classes of such, 

M g>n (X,f3) = {[EJ, u, pi,... ,/7„]|«*[E] = 0)1 ~ . 
There are natural evaluation maps, ev^ : M gt „(X, (5) — > X given by 

ev*([Ej,K,pi, . . . ,/?„]) = u(pk). 







W2 




W3 


W4 


- 




Zl 


^2 




Zl 


Z2 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



221 



Using these, we have our first definition of the Gromov-Witten invariants. Let 
71 , . . . , 7„ E H*(X; Q), (3 E H 2 (X; Z) and define the Gromov-Witten invariants by 



You will notice that there are many notations in this definition that we have not defined 
yet. We will slowly compute the Gromov-Witten invariants of CP 2 (hence how many 
cubic parameterized curves pass through 8 points etc) and the Gromov-Witten invariants 
of X s i . Along the way, we will define the extra notations used in the above formula. For 
now when you see [M. gj n(X, /?)] vu you should just think M g: „(X, j3). We will see that 
the overline refers to a compactification of this space later in this article. The calligraphic 
font refers to the stack structure on the moduli space. Intuitively the stack structure 
'adds' a group to each point of the space in order to have a proper count of points 
taking symmetry into account. The necessity of stacks is motivated in the first article 
in Section 5, and the definition of the moduli stack M. g n QC, /?) is given in the second 
article of this subsection. The general definition of a stack is given in Appendix A, and 
the best example is contained in Section 10. The square brackets with vir superscript 
indicate the virtual fundamental class. This is motivated and defined in Section 9. The 
short explanation is that when intersections are counted one generally assumes that 
objects are in general position. However, one can still get sensible answers when the 
objects are not in general position provided one uses the correct virtual fundamental 
classes. 

To make sense of the integral in the definition we need to have a fundamental cycle to 
integrate over. This is easiest to establish when the domain of integration is compact. 
As defined above the coarse moduli spaces would not be compact because we insist that 
the marked points be distinct. Even without considering marked points these spaces will 
fail to be compact. To see the problem, consider the family of degree two parameterized 
curves u n : CP 1 -> CP 2 given by u n ([s :?]):= [n~ l (s 2 + t 2 ) : 2rT l st : s 2 - t 2 ] . The 
limit appears to be given by Uoo([s '■ ?]) = [0 : : s 2 — t 2 ] , but this is not a well defined 
map (consider points where s = ±f )■ The image of the map u n in an affine chart is just 
the hyperbola, x 2 — y 2 = rT 2 . The geometric limit of this sequence is just a pair of 
lines. This is a clue that motivates the correct definition of a compactification of the 
moduli space. The correct limit is a map from the one point union of two copies of the 
complex projective line. The domains and (real part of the) images of u„ and the limit 
Moo are displayed in Figure 3.1. To be precise the limit is defined by 

Moo : CP 1 x {±1} CP 2 ; Moo([s :t],e):= [s:es :t]; e = ±1. 

The splitting off of an extra component in the limit is called bubbling. 
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Figure 3.1: Bubbling 



To describe domains of stable maps we take a disjoint union of Riemann surfaces 
(called the normalization) and glue them together along special points called nodes. 
The resulting 'surface' needs to be connected. These objects are called prestable 
curves or nodal Riemann surfaces. The original surface without any identifications is 
called the normalization. The nodes are locally modeled on {(x,y)|xy = 0}. There is 
also a notion of a smoothing of a prestable curve obtained by replacing each node by 
{(x, y)\xy = e} . An example of a marked prestable curve together with its normalization 
and smoothing is shown in Figure 3.2. Precise definitions are given in the paper 




Figure 3.2: Prestable curve, normalization and smoothing 



by Siebert on Gromov-Witten invariants for general symplectic manifolds [142]. In 
fact, this is a superb reference for the definition of Gromov-Witten invariants in the 
symplectic category for readers who are more familiar with differential topology than 
algebraic geometry. While we are on the subject of references, we should mention the 
book by Hori et al [77], the book by Cox and Katz [43] and the little-known book edited 
by Aluffi [8]. These are the main references from which this section on Gromov-Witten 
invariants was derived. For more information see these sources and the references 
contained therein. 
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What follows is a more formal definition of a marked prestable curve. 

Definition 3.3 (Siebert [142]) A marked prestable curve is a pair, (E,p) where S is 
a reduced, compact, connected, one-dimensional, complex, projective variety with no 
worse than ordinary double point singularities, and p is an rc-tuple of pairwise distinct 
regular points. 

An ordinary double point is a point modeled on {(x, y)\xy = 0} (see Griffiths-Harris 
[68] and Hartshorne [75]). Reduced means that there is no locally defined non-zero 
holomorphic function with a power equal to zero. To see how such a function could 
arise consider the variety defined by x 2 y = 0. This looks like a one point union of 
a pair of lines, but it is not a reduced variety because xy is a non-zero holomorphic 
function with square zero. The one point union of a pair of lines is represented by the 
reduced variety defined by xy = 0. 

A prestable map is a holomorphic map from a prestable curve to a symplectic manifold 
or projective variety. The definition of a morphism of marked pseudoholomorphic 
curves, and resulting notions of equivalence, stability and automorphism group extend 
naturally to prestable curves. This allows one to define a compactification of the moduli 
space as follows. 

Definition 3.4 The (compactified) coarse moduli space of genus g, n -marked curves 
is 

M g , n (X,(3) = 

{[£,/?, u] | m*[E] = and (S,p) is a genus g, ^-marked prestable curve 

such that [S,p, u] is stable}/ ~ . 

It may seem weird that this space is compact when it is required that the marked points 
be disjoint from each other and the nodes. The limit of a sequence where two or more 
marked points or nodes collide will be described in the discussion of the boundary 
divisors and tp classes in Section 4 on the cohomology of the moduli space. Basically, 
the idea is that one or more new components bubble off at the point of collision and the 
marked points move into these bubbles while staying distinct. Before we talk about 
a compactification, we should introduce a topology on the moduli space. We will 
conclude this subsection with a description of the topology used in the symplectic 
category. 

In the symplectic case a topology on the space of stable maps can be described via 
Gromov convergence . The main difficulty in defining it is that the limit map may have 
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a different domain (as we saw in the example depicted in Figure 3.1). To circumvent 
this difficulty define a resolution k : £ — > £ between two prestable curves to be a map 
satisfying: 

(1) If p G £ is a node, then n(p) is a node; 

(2) If q G £ is a node, then is either a node or a circle disjoint from nodes; 

(3) If V is any neighborhood of all nodes in £, then k is a diffeomorphism 
onto its image. 

One says that a sequence of stable maps (S^, uu) Gromov converges to a map (£, m) if 
there is a sequence of resolutions Kk : £& — > £ such that for any neighborhood V of all 
nodes in £ : 

(!) u k ° ^ -p^" « in C°°(£\V); 

(2) dnkojkodK^ 1 r——> j in C°°(£\V), where are complex structures on 
and S respectively; 

(3) Area(M fc (S fc )) ^ Area(«(E)) . 

Equipped with the Gromov topology the moduli space is Hausdorff and compact. One 
can check that the image homology class (3 G H2(X, Z) and the (arithmetic) genus are 
preserved in this limit. With this topology the coarse moduli space has the structure of 
a generalization of a manifold called an orbifold. See Siebert [142] and the references 
contained therein for more details. 

Remark 3.5 For genus zero invariants of convex spaces such as CP" the moduli space 
is in fact a manifold. We will discuss the moduli space as if it were a manifold until 
Section 5 where we explain where the orbifold singularities arise. 

3.2 The algebraic construction 

The construction of the coarse moduli space for projective varieties works a bit differently. 
One can start working from the ground up by studying some examples. The easiest 
example has X equal to a point (so (3 = 0) and the genus equal to zero. This amounts 
to studying configurations of n distinct points in CP 1 modulo equivalence by degree 
one rational maps. Any degree one holomorphic map from CP 1 — > CP 1 takes the form 

¥>(b : w]) = [az + bw : cz + dw] 
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where ad — be ^ . In affine coordinates on the domain and codomain this takes the 
form ip(z) = . It is easy to see that the following function for fixed zo, Zi and zi is 
a linear fractional transformation taking Z3 = Zo to 0, Z3 = Z\ to 1 and zs = Zi to oo. 

Definition 3.6 The cross ratio is the function defined by 

, , (zi - Z2)fe - zo) 

j(Z0,Zl,Z2,Z3) ■= ~ ~ r. 

(ZO -Z\)(Z2 -Z3) 

It is easy to say what the cross ratio means - it is nothing more than the image of the 
fourth point under the unique linear fractional transformation taking the first three points 
to 0, 1, and 00 respectively. We now follow the exposition of D Salamon which utilizes 
cross ratios to realize Mo, n (pt, 0) as projective algebraic varieties (see Salamon [134]). 
For each tuple (ij, k, 1) of distinct positive integers less than or equal to n define a 
function, 7^-^ : M 0>n (pt,0) CP 1 by 7, w ([pl) := j(Pi,Pj,Pk,Pt)- One can easily 
check that these functions satisfy the relations 

lj,i,k,e = Hj,i,k = 1 - Hj,k,i, 

lij,k,£%kj,i — li,kj,£ = lij,k/i 
lj,k,£,mlij,k,m ~ 'Yj,k,£,mlij ! k,£ = Hij,k,m ~ 1- 

As a particular case, we see that the cross ratio maps Mo t 4 isomorphically to CP 1 — 
{0, 1, oo}. It is natural to guess that this extends to a bijection between Moa and 
CP , which in fact it does. The stable maps corresponding to 1 and oo are displayed 
on the left of Figure 4.4 in the next subsection. More generally, the cross ratios of 
the marked points may be used to identify Mo,„ with the projective subvariety of 
(CP 1 ) xA ' specified by the solutions to the above displayed equations in the 7. Here 
Af = n(n — \){n — 2)(n — 3) is just the number of possible distinct 4-tuples marked 
points. In the following aside we continue with a very brief description of the algebraic 
construction of the coarse moduli spaces of higher genus curves as projective varieties. 

Aside 3.7 We can now step things up a bit and consider moduli of higher genus curves. Here 
our exposition follows that of D Mumford from [115]. Let S be agenus g > 1 curve and Ks 
be the canonical bundle (top exterior power of the cotangent bundle). Using the Riemann-Roch 
theorem one can compute that the dimension of H°(E, K^) is 5g — 5. The space //°(S, K^) is 
just the space of globally defined holomorphic forms of the form f{z)dz^ ■ Given a basis for 
H°(E,Z|), say {wj, define a map to CP 5g ~ 6 by <f>(z) = [w k (z)] ■ Here we use any trivialization 
of K^. around z to identify the oj/dz) with complex numbers. Changing the trivialization clearly 
does not change the projective equivalence class. The Weierstrass points of E are defined to 
be those points in E for which the tangent plane to E in CP 5? ~ 6 matches to order 5g — 5 
or more. There are g(5g — 5) 2 such points counted with multiplicity, label them by Zj ■ Now 
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take a large N and consider the following set of functions from 5g — 5 element subsets of 
E = { 1 , . . . , g(5g — 5) s } to the non-negative integers 

R = {r : {I C E\\I\ = 5g - 5} -► Z\r(J) > for all J and E^rfT) = 
Define an embedding M gfi -> CP |S| ~' by 



53 [] (det, e/ (^(z > ))) 

o-eperm(£) /CE,|/|=5g-5 



r(<x(/)) 



The determinants in the above expression lie in K^ N ; they can be interpreted as complex 
numbers by evaluation in any trivialization. 



Exercise 3.8 Assuming that this construction works, jazz it up to define an embedding of M, 
into a sufficiently large projective space. This is difficult, but luckily it is not needed. 



There are other approaches to proving that M g „ and M gn admit the structure of quasiprojective 
and projective varieties respectively, but no way is easy. See Mumford [113] for the standard 
exposition. 

The next step is to describe the structure of M g ,„(CP r , d) . The final step is to define M gy „(X, (3) for 
general projective varieties X. These last two steps are not so bad. Our exposition comes from the 

lectures by Aluffi [8]. Given 6 M gJ1+ d {r+1 ) such that the divisors (p„+irf+H \-p n +kd+d) 

and {p„+£d+\ + ■ ■ -+p n +£d+d) are linearly equivalent for k, I = 0, . . . , r and non-zero sections 
of the line bundle associated to these equivalent divisors such that Sk(p n +u+\) = Sk(Pn+kd+d) = 
one can associate a stable curve [u,T,,q] 6 M g ^ n (CP r ,d) by qj = pj for j = 1, . . . ,n and 
u(z) — [sk(z)]- It is not hard to see that two pairs and ([£' ,p'],s') produce the 

same stable map if and only if s and s' agree up to a constant factor and [X,/?] and [£' ,p'] 
agree after a permutation in the marked points fixing pj for j = 1 , . . . , n and each divisor 
(Pn+kd+i + ■ ■ ■ Pn+kd+d)- The subset of M g ^ + d( r +\) satisfying the equivalent divisor condition 
is a subvariety, and the set of data that we have described here forms a (C x ) r+1 bundle over 
this subvariety. The quotient of this bundle by the group generated by the change of scale and 
permutations produces a quasiprojective variety that embeds into M g „(CP' , d) as an open set. 
Of course we can embed it in a different way by composing each map with a fixed holomorphic 
isomorphism of CP' . The fact is that by choosing a finite number of such isomorphisms we 
can completely cover M ? ,„(CP r , d). To see this, pick a basis, {f^}, for //°(CP r , C(— 1)). Then 
to any generic [w,!],*/] G M g .„(CP' ,d) we associate ([£,/?], o u), where p is obtained by 
adjoining the zeros of the o u to q. 
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4 Cohomology of the moduli space 

We see that our first definition of Gromov-Witten invariants is just the evaluation of 
natural cohomology classes on the moduli space. This leads one to ask if there are 
any other cohomology classes on the moduli space. Indeed there are other interesting 
classes. In this subsection we will define some of them and derive several important 
recurrence relations between the Gromov-Witten invariants. The definition of a new 
set of cohomology classes appears in the first article and the recurrence relations are 
described in subsequent ones. The paper by R Vakil [154] is also a good reference for 
this material. 

Here we describe the new cohomology classes as Chern classes of natural vector bundles 
over the moduli space. In general, as explained in Section 5 one needs a generalization 
of vector bundles called orbibundles that allow finite quotient singularities. However, in 
a number of examples singularities do not appear (Remark 3.5). For now we assume 
that everything is smooth and introduce more general examples in Section 5. 

4.1 Gromov-Witten invariants and descendants 

We recall the intersection theory definition of the first Chern class of a line bundle. 
There are many other possible definitions, see Milnor-Stasheff [1 1 1], Griffiths-Harris 
[68] and Bott-Tu [30]. Given a line bundle L — > X and two generic (transverse) 
sections <jq, o~\ \ X — > L, the first Chem class of the line bundle may be defined to 
be the cohomology class Poincare dual to crj~ (ao(X)). As an example, consider the 
tangent bundle to the 2-sphere. A section of the tangent bundle is nothing other than 
a vector field. We can (and generally will) take ao to be the zero section. We can 
take o\ to be a vector field that flows up from the south pole to the north pole. As a 
set crj~ (cto(S 2 )) consists of exactly two points. One can see that the intersections are 
transverse, and conclude that a(TS )[S ] must be —2, or 2. It is in fact 2. 

Exercise 4.1 Write out consistent orientation conventions and verify that the signed 
count of zeros implies that ci(r»S 2 )[»S 2 ] = 2. 

Exercise 4.2 Prove the following properties of line bundles and their Chern classes. 

(1) ci(C) = (Here and elsewhere V will denote the trivial bundle with fiber V.) 

(2) c 1 (L l ®L 2 ) = c l (L 1 ) + c 1 (L 2 ). 

(3) c 1 (f*L)=f*c l (L). 
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(4) L®L* ^ C. 

There are n distinguished line bundles defined over M g:fl (X,f3), denoted by for 
k = 1 , . . . , n . Intuitively, these bundles are specified by an identification of the fiber 
over each point as 



This allows one to define new cohomology classes on M gin (X,f3) and extend the 
definition of the Gromov-Witten invariants. 

Definition 4.3 The ^-classes are defined by ip^ := c\(Cu) £ H 2 (M gin (X, /?)). The 
descendant Gromov-Witten invariants are defined by 



Here 71 , . . . , 7,, 6 H*(X; Q), f3 e H2QC; Z) and the integral of a form over a space is 
defined to be zero if the degree of the form does not match the dimension of the space. 

It is often possible to reduce the computation of Gromov-Witten invariants on one 
space to a computation of descendant invariants on a smaller space. In addition there 
are recursion relations relating various descendant invariants. 

We should now make the definition of the bundle Ck more precise. To do so, we need to 
study the relationship between moduli spaces with different numbers of marked points. 
There is a natural projection from M g>n +\(X, (3) to M g>n (X, (3) given by ignoring the final 
point. One subtle point is that after deleting a marked point a stable map with marked 
points may no longer be stable. This can be fixed by stabilization. The stabilization 
of a prestable map, st([M, £,/?]) is defined by identifying to a point any component of 
the normalization of £ on which an infinite subgroup of the automorphism group acts 
effectively (that is, only the identity element fixes everything). This gives, 

7r: M g7 n+i(X,/3) -> M g! n(X,P); ir([u, . . . ,p n +i]) ■= st([u, £,/?i, . . . ,/?„]) . 

See Figure 4.1 to see the result of projection with nontrivial stabilization. 

There are also inclusion maps going in the other direction defined as follows: 



where u is defined by m|s = u and w| CP i = u(pk)- See Figure 4.2 to see the result of 
inclusion at a marked point. 





p k : M g , n (X,P)^M g}n+l (X,P); 
p k ([u, E,pi, . . . ,p n \) := 

[«, £ U w=[0 :i] CP 1 ,/?!, . . . ,/?£_!,[! : U,Pk+i, ■ ■ ■ ,P. 



[1 : 0]] 
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Figure 4.1: Projection from M23 to M^^ 



2 4 




1 1 

Figure 4.2: Inclusion of M2.3 in Mi t \ at p2 



As an aside we can use these last two figures to explain the limit of a sequence in which 
one marked point approaches a node or approaches a different marked point. If a third 
marked point is added to one of the genus zero components of the curve on the right 
side of Figure 4. 1 , the limit as this point approaches the node between the genus zero 
components will be the curve on the left. If a fourth marked point is added to the curve 
on the left of Figure 4.2, the limit as this point approaches p% is the curve on the right. 

Setting U = M g ^ n+ \{X, (3), the collection of maps, 

(tt : U - M g} „(X, (3), Pk : M gj „(X, ft U, ev„ +1 : U - X) 

is an example of a family of stable maps. The key property is that for every so G 
M g>n (X, P), we have that [ev|^-i (Jo)) vr- 1 ^), P1O0), • • • , Pn(so)] G M g) „(X, /?). 

Exercise 4.4 Given that sq has trivial automorphism group prove that so is isomorphic 
to the following stable map. 

[ ev l7r-k.v )> 7r ~ 1 ( 5 o),Mso), • • -,Pn(So)] ■ 

This almost implies that U is a 'universal' family of stable maps. What happens if so 
has nontrivial automorphisms? See Section 5.2. 
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We are now ready to give a precise definition of the bundles C k ■ This definition will 
only work as intended when stable maps have no nontrivial automorphisms. To define V 
in general one needs to use the language of stacks. For starters, define the subbundle V 
of vertical vectors on U to be those vectors that are tangent to the fibers of the projection 
7r . Next, recall the definition of the pull-back of a vector bundle. Given a vector bundle 
7r : E — > Y and a map / : X — > Y the pull back is defined by 

f*E := {(x,e) e X x E[f(x) = 7T(e)}. 

Now we have, 
Definition 4.5 Let 

V := ker(vr* : TU -► TM g>n (X, (3)) . 

be the vertical subbundle. Then the tautological bundles over the moduli space are 
C k := p* k V* , where V* is the dual of V. 

We have seen that in the absence of automorphisms the inverse image of a stable map 
so € M g: „(X, (3) under the projection from M gjn+ i(X, (3) to M gjfl (X, (3) is isomorphic to 
the original stable map so ■ The image pk(so) corresponds to the marked point p k in the 
isomorphic copy of so- It follows that the fiber of p\V over so = [u,T,,p] is isomorphic 
to T Pk Yi. Thus the fiber of Ck over a stable map is isomorphic to the cotangent space of 
the underlying curve at the associated marked point. The analogous construction in the 
category of stacks will work when there are automorphisms. 

4.2 Boundary divisors 

We need a way to describe the cohomology of the moduli spaces. By intersection theory 
we can identify a & -dimensional cohomology class with a real codimension k cycle. 
One sees that the set of stable maps with one node has complex codimension 1 , so may 
be used to define real dimension 2 cohomology classes. The classes defined in this way 
are called boundary divisors. 

Definition 4.6 Given two disjoint sets A and B such that AUB = {I, . . . ,n + m} with 
order preserving bijections 7 a : {l,...,n} —> A and jb'. {l,...,m} —> B, non-negative 
integers g and h and second cohomology classes a and (3 in X, we define the boundary 
divisor D(g,A, ot\h, B, (3) to be the push-forward of the fundamental cycle by the map, 

M g>n+l {X, a) x M Km+l {X, (3) -»• M g+h>n+m {X, a + (3); 

([w, £,/?], [u, £',//]) i ► \u U u, S U Pn+1=(?m+1 r], 

where r 7aW :=/?* and r jh{k) := q k . 
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The boundary divisor D(g,A, a\h,B, (3) corresponds to the closure of the subset of 
moduli space of the set of stable maps with one node joining two irreducible components 
having data g,A,a and h,B,(3 respectively. See Figure 4.3. There is an additional 




a + (3 a (3 



Figure 4.3: Degeneration to the boundary divisor D(l, {2}, a\l, {1, 3,4}, j3) 

type of boundary divisor corresponding to the degeneration of a non-separating simple 
closed curve in the domain. This divisor is typically denoted by Ao- 

Exercise 4.7 Give a formal definition of Ao . 

When factors in the definition of a boundary divisor are clear from the context we will 
leave them out of the notation. For example D({1, 3}|{2, 4}) in Mq,4 is short-hand 
notation for D(0, { 1 , 3}, 0|0, {2, 4}, 0), as there is no possible way to have nontrivial 
genus or homology in this case. There is an additional boundary divisor obtained by 
identifying two points on one curve, or equivalently degenerating a non-separating 
simple closed curve. This divisor is typically denoted by Ao- The map combining 
two curves into one nodal curve is very similar to constructions gluing two surfaces 
with boundary along a boundary component. This latter operation is common in the 
analysis of topological quantum field theories. As an example consider the divisors 
D({1,2}|{3,4}) and D({1, 3}|{2,4}) in M 0A . We have identified M 0A with CP 1 
via the cross ratio. Under this identification we are mapping the first point to zero, 
the second to one and the third to infinity and then looking at the coordinates of the 
last point. Thus D({1, 2}|{3, 4}) is identified with infinity and D({1,3}|{2,4}) is 
identified with one, so these divisors are distinct. They are however linearly equivalent, 
which implies that they represent the same cohomology class. The equality of these 
cohomology classes is the starting point for the Witten-Dijkgraaf-Verlinde-Verlinde 
equations (WDVV equations). The WDVV equations are similar to a consequence of 
the fusion rule for monoidal functors, see Figure 4.4. 

This similarity is one good reason to believe that there may be some relationship between 
Gromov-Witten invariants and Chern-Simons invariants. 

The linear equivalence of these two simple divisors is more powerful than it may first 
appear. The projection map from Mq^X, f3) to Mq,4 defined via stabilization whenever 
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Figure 4.4: The WDVV equation and the fusion rule 



n > 4 translates this equivalence into many more moduli spaces. To demonstrate the 
power of this equivalence, the next exercise outlines a proof of Kontsevich's recursion 
for the genus zero Gromov-Witten invariants of CP 2 . The answer to this exercise is 
explained nicely in the book by Hori et al [77]. 

Exercise 4.8 Let N c i be the Gromov-Witten invariant 

(pt, . . • ,pt)o^ [CP i r 

This is the number of degree d parameterized curves that pass through 3d — 1 generic 
points in the plane. Let ir: Mo,3d(CP ,d[CP 1 ]) —* Mo,4 be the standard projection. 
Finally, let Y be a cycle in Mo j3i: ;(CP 2 , cffCP 1 ]) representing 

evttCPV^CPV^pt] • . . ev^[pt]. 

We are denoting cohomology classes by their Poincare duals. Of course one should take 
a generic collection of two hyperplanes and 3d — 2 points when writing a representative 
for Y. 

(1) Express ir*D({ 1 , 2} | {3 , 4}) and ir*D({ 1 , 3} | {2, 4}) in terms of boundary divisors 
onMo^/CCPVtCP 1 ]). 

(2) Show that #(Y n D({1, 2}, 0|{3, . . . , 3d},d[CP 1 ])) = N d . 

(3) Argue that #(F n D{A,e\B,f)) must be zero unless B has the 'right' number 
of points. When B does have the right number of points construct a covering 
projection 

Y n D(A, e\B,f) -> (M , 3e -i(CP 2 , 4CP 1 ]) H evifei) n . . . ev 3 ^i(<? 3 ,-i)) 
x (Mo^j-KCP 2 ,/^ 1 ]) n evi(ri) n . . . ev3/-i(r 3 /-i)) . 

(4) Express #(Y n vr*D({l, 2}|{3, 4})) in terms of the numbers N e . Hint: When 
computing #(Y n D(A, e\B,f)) recall that a degree e curve intersects a degree/ 
curve in ef points when you are enumerating the locations of the node and the 
two marked points that lie on the lines. 
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(5) Express #(Y n tt*D({1, 3}|{2, 4})) in a similar way. 

(6) Use the linear equivalence to deduce a recurrence relation between the various 
Nd, and test your recurrence by computing a few values. You should get N2 = 1 , 
N 3 = 12, and N 4 = 620. 

If these Gromov-Witten invariants are put into a suitable generating function the 
recurrence relation will translate into a differential equation. One important example of 
this allows one to conclude that the descendant Gromov-Witten invariants of a point 
are encoded into the solutions of the KdV equation. This may be considered as one of 
the first tests of Large N Duality for the case of zero-dimensional space. We however, 
will not emphasize this aspect of Large ,/V Duality here. See the paper by Witten on 2D 
gravity [160] for a discussion of this when it was still a conjecture and the paper by 
Kontsevich for a proof [88]. 

4.3 The string and dilaton equations 

We now turn to a description of the recursion relations between the ^-classes. For 
clarity, we will denote the first tj; class on M g>n (X, (3) by %jr[ when we are comparing 
V>-classes on M gt „(X,f3) and M g : „+i(X, f3). We will use a similar notation for the 
tautological bundles. We have by definition, 

To compute this Chern class, we just need to construct a section of the bundle 
of this bundle may be viewed as a vector bundle morphism, s : p[ y (n+1) — 
We can view tangent vectors to the universal bundle as paths of stable maps 
morphism is given by 

s([u, £,/?i, . . . ,p n +i], [u, E,pi, . . . ,p„+i, q(t)]) := 

([w, £,/?i,. . . ,/>„ + i],st[u,E,/>i,. . . ,/>„], st[w, E,/?i, . . . ,p n ,q(t)]). 

Here q(t) is a smooth path in E with lim ; ^o q(t) = p\ , and the above formula is correct 
when t ^ . The underline on the second stabilization refers to the fact that it is the 
result of stabilizing the [u, S,p] and then placing the q(t) at the corresponding point of 
the stabilization. The above formula appears complicated because of the pull-backs in 
the definitions of the two bundles. This bundle map is identically zero over the boundary 
divisor D(0, {1, n + 1}, 0\g, {2, . . . , n}, j3) because the entire bubble containing p\ and 
p n +\ collapses to a point when p n +\ is forgotten. The resulting map is stabilized and 



. A section 

♦ TT*p*V {n) . 

. A natural 
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thus the path q(t) becomes constant. In fact, this is the only way that this section can be 
identically zero over some point. One concludes 

^n+l) _ ^(n) = D(Q) {l n+ {2> jn})/3) _ 

The recursion between cohomology classes that we just derived implies three important 
recursion relations between descendant Gromov-Witten invariants. We will prove two 
of these recursions; the string equation and the dilaton equation. This will require use 
of the Thorn isomorphism and the Euler class of a vector bundle. 

Recall that relative cohomology classes may be represented as the Poincare duals of 
closed cycles. Given a real rank r vector bundle, ir : E — > X over a closed base, there 
is a natural relative cohomology class, $ G H r (E, E — ao(X)) (the Thom class) given as 
the Poincare dual of ao(X). The Thom isomorphism is the map 

H k {X) -> H r+k (E,E - a (X)) 
an$u ir*a. 

The Euler class of the vector bundle is the pull-back of the Thom class by a section, 
e(E) := a j G H r (X). Notice that the Euler class of the real bundle underlying a 
complex line bundle is the first Chern class of the line bundle. It is easier to put all of 
this on a firm theoretical foundation by using cohomology; however the conceptual 
picture is harder to follow. See for example Milnor-Stasheff [111] Spanier [148]. 

Using this technology we will establish two important recursion relations for Gromov- 
Witten invariants. The first is the string equation, 

n 

(T fll (7i) • • • r a „(7„)r (X))^ = ^(r fll (7i) . . . r^-xiik) ■ ■ ■ T an (ln)) X ^p , 

k=i 

and the second is the dilaton equation, 

(r fll (7l) • • • T an (j n )n(X))lp = (2g-2 + n)(r fll (70 . . . r fl „( 7 „))^ . 

We can now prove the string equation (we denote cycles and their Poincare duals by 
the same symbols). To start the computation, we use the definition of the descendant 
Gromov-Witten invariants and the relation between the tp -classes on the moduli of 
(?i+l)-marked curves and the moduli of rc-marked curves. After expanding the powers, 
we recognize that the first integral is trivial because the moduli space of n-marked 
curves is lower dimensional than the moduli space of (n+l)-marked curves. Any 
cycle containing a factor of DiDj for i^j is empty because the set of all stable maps 
with pi and p n +\ isolated in a single bubble is disjoint from the set of all stable maps 
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with just pj and p n+ i in an isolated bubble, so the last integral below is trivial. Using 
evjt o 7r = ev# on the remaining integral gives 

<r ai ( 7 l) • • • T an (ln)T (X))l p = f ^ • • • <"ev l7 l ■ ■ ■ <ln 

J\M„n^(X,8)V k 



L 



i[M g , n+ i(X,P)y 
(vr>i + Di) ai . . . (ir*i>„ + D„) fl "evt 7 i . . . ev* 7 „ 



[M g , n+ i(X,P)y 
[ 7r*« 1 ...<"evhi...ev: 7 n) 

J[M g ,„ +l (XM vk 
n at 



+ E E ? /_ 1 • • • <"^ ev ^ • • • ev ^«) 

k=lp=1 \P J J[M g . n+l (XM m 



+E/ .(w-)- 

We next change the integrand on the remaining integral to N{Dk) because D k is 
supported in this tubular neighborhood. The equation we use in the second step below, 
f E &(3 = f x a*(3, is easy to understand from the view of intersection theory. Any 
variation of a stable map in the boundary divisor 

D k :=D(0,{k,n+l},0\g,{l,...,k,---,n},P) 

may be decomposed into variations that move pk and p n+ \ out of a common bubble 
and variations that leave these two points in a common bubble. The normal bundle of 
Dk consists of those variations that move the points out of a common bubble, that is, the 
vertical bundle restricted to Du . It follows that the Euler class of the normal bundle is 
given by 

e(N(Dk)) := ci(£j) = 

Together with a combinatorial identity from the binomial theorem, this allows us to 
complete the derivation of the string equation. 

= E E ? / ^ W ■ ■ ■ <"V^ev l7l • • • <ln) 

n Ok / \ p 

= E E tj / vir Pl • • • V^evhi . . • evfr*)) 

= E E \ ) - ■ e(N(D k )f~ 1 ^ 1 . . . eVevt7i ■ • • ev^n 
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n <2£ 



SJrJW Am,., 

n ,. 

fc= l J[A4 s ,„(X,/3)]« r 
n 

E( T «i^!) • • • r ^-l(7*) • • • T an (-fn))g,/3 
k=\ 



Example 4.9 We will use the string equation to compute a couple of descendant 
invariants. First consider the integral 



I, 



[X ,6(pt,0)]™ 

This integral is denoted by (r2(pt)ri(pt)(ro(pt)) 4 )Q t . Using the string equation we 
obtain 

(^(ptMCptXToCpt)) 4 )^ = (rKpOnCpOCroCpt)) 3 )^ + (r 2 (pt)(r (pt)) 4 )P; 

= 3(r 1 (pt)(ro(pt)) 3 )P; o = 3((r (pt)) 3 )P; = 3 . 

More generally consider the integral 



L 



[A4o,„(pt,0)]™ 

where a\ + • • • + au = n — 3. Using the string equation this can be reduced to a sum of 
terms with the power of one of the tp decremented. Repeating this, a total of n — 3 
subtractions without combining like terms will lead to a sum of ones. Each term of this 
sum will correspond to selecting a\ of the subtractions, then a-i of the subtractions, etc. 
This shows that the value of the integral is ( a[ "~ 3 aL ) ■ The symmetries in this formula 
arising from rearranging the cij correspond to the geometric operations on the moduli 
space obtained by rearranging the marked points. 



The derivation of the dilaton equation is similar. 

<r fll (7i) • • • T an (7n)n(X))lp = [_ ■ • • CVfe+ievfri • • • <ln 



[M g , n+1 (XM V 

(tt*Vi + £>i) fll • • • (vr>„ + D n ) a "V>«+iev l7l . . . ev,; 7n 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



237 



/ V«+i7r*(Vf • • • <"evt7i • • • ev*7„) 

n a k 



+ EE H /- • • • <"^ P evt7i • • • ev: 7 «) 

k=l p=1 \P J J [M g ,n+ i(XM vh _ . 

+EL m--.) 

i[yV( Si „ + i(X,/3)]™ 

Two new observations need to be used. The first is that the evaluation of fa+i on the 
fiber of the projection is the evaluation of the first Chern class of the cotangent bundle of 
a surface (on the surface), which is 2g — 2. The second observation is that the pull-back 
of fa+i under the natural map that interchanges and p n+ i is fa. The restriction of 
this map to the divisor is trivial, so these two bundles agree over this divisor. 



= (2g - 2) f 7r*«' . . . <"ev l7 i • • • <ln) 

J[M g ,„(XM vk 

+ E E ? ) /_ A (^ + i^-V*(^ . . . ■ ■ ■ 

= (2g - 2) /_ 7r*(Vf • • • <"ev l7 i • • • <ln) 

J[M g , n (XM™ 
n " k / \ r 

+ E E \) - fae(N(D k )f- 1 ^ . . . <"CeVi7i • • • <7» 

= (2g - 2) / 7r*«' . . . <"evt 7 i • • • ev ; : 7 „) 

J[M g ,„(X,P)Y K 
n a k , v „ 

EE / ^ 1 ...<"evh 1 ...ev: 7n 

~\ \P J J[M g , n (XM viT 



k=\ p=l 



(2g-2 + n)(r fll ( 7 i)...r fln (7„))f 



To conclude this subsection we should remark that there is a different natural family of 
cohomology classes on these moduli spaces that arise as the Chern classes of the Hodge 
bundle. This bundle will appear after we discuss the deformation-obstruction sequence 
in Section 5.6. In addition there is a third important recurrence relation known as the 
divisor equation (see Hori et al [77]). 



5 Local structure of moduli spaces 

We begin this section by pointing out what seems to be a paradox in this theory. When 
the Gromov-Witten invariants of certain integral classes are computed one can get 
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non-integral answers. A simple example can explain the origin of this paradox. This is 
how we begin the following subsection. 

5.1 Orbifolds and M M 

The group of orientation preserving isometries of an octahedron acts on 5 and on TS 2 
in a natural way. The quotient of S 2 by this group action is topologically a new copy 
of S and the degree of the quotient projection is 24. Note that the projection is not 
a covering projection, rather it is a branched cover, branched over the points labeled 
with finite cyclic groups in Figure 5.1. The cyclic group labels are just the stabilizer 
groups of the points in the preimage of each branch point. By keeping track of the local 
symmetry groups, one can develop a theory of branched covers that is very close to the 
theory of covering spaces. This gives rise to the notions of orbifolds, orbibundles and 
their characteristic classes. Orbibundles are not locally trivial over labeled points, but 
rather look (locally) like quotients of such bundles by the label groups. 

Thus an orbifold is locally a quotient of Euclidean space by a finite group where one 
keeps track of the local symmetry groups. Note that orbifolds are not manifolds with 
singularities. Topologically, the points labeled with the finite cyclic groups in Figure 5.1 
are locally homeomorphic to C (although this need not be the case in general), but the 
stabilizer groups make these points special. 

The formal definition is a Deligne-Mumford stack over the category DIFF. This is 
formalized in Appendix A, but we wanted to give an intuitive description first. We work 
out further examples in Section 10 below. 

Continuing with the quotient of S 2 by the octahedral group, note that by the third part 
of exercise in Exercise 4.2, we know that the Chern class of a pull-back bundle is the 
pull-back of the Chern class of the bundle. Since the first Chern class of TS 2 evaluates 
to 2 on the fundamental class of S 2 we would have to conclude that the first Chern class 
of the quotient bundle of TS 2 by the orientation preserving octahedral group would 
have to be ■ Of course there is a map TS 2 / S 2 / ~ ; however, this map does not 
give TS I ~ the structure of a vector bundle. It gives it the structure of an orbibundle. 
Chern classes of orbibundles are defined; however, they are not always integral classes. 

To see that this situation arises in Gromov-Witten theory consider the structure of the 
moduli space M\ \ := M\ i (point, 0). It is a fact that every smooth genus 1 Riemann 
surface is equivalent to the quotient of C by a lattice (see Griffiths-Harris [68]). All of 
these admit a natural group structure, so without loss of generality we may assume that 
the marked point is the equivalence class of zero. Next, notice that any biholomorphism 
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Figure 5.1: The octahedral orbifold 



between a pair of tori must be induced from a linear endomorphism of C. To see this, 
compose a given biholomorphism, (p : T — ► T' , with the projection C — > T and notice 
that the resulting map lifts to a biholomorphism <p: C —> C. Now ((^(z -1 )) -1 has an 
isolated singularity at zero that must be removable as this function is bounded near zero. 
It follows that the map (p extends to a biholomorphism of CP 1 taking infinity to infinity. 
Such must be a linear map when restricted to C. The map z ► — z is an automorphism 
of any torus, thus every point vaM\ \ has stabilizer containing Z2. 

Exercise 5.1 What is the stabilizer of the unique nodal curve in Mi ; i ? 

We may assume that one generator of any lattice used to construct a torus is 1 and that 
the other generator is in the upper half plane by applying a suitable linear transformation. 
Equivalently, the second generator is chosen so that the generators of the lattice form 
a positively oriented basis. It is standard to parameterize a once-marked torus by the 



location of the second lattice point. The choice of second lattice point is however not 
unique. Clearly, adding one to the second generator z >—>■ z + 1 does not change the 
torus. Similarly, interchanging the two generators and changing the sign of one and then 
renormalizing does not change the torus, z > — z~ l ■ These two operations generate an 
action of SL2L on the upper half plane. The quotient of the upper half plane by this 
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action may be identified with M\ t \ ; adding a single point at the cusp representing the 
nodal marked torus gives M^i . A fundamental domain for this action and the quotient 
is displayed in Figure 5.3. 



We can now analyze the group of automorphisms of one of these marked tori. Consider 
the torus labeled by u := e 3 . We have seen that any automorphism must be of the 
form [z] 1— > [az] ■ Since 1 is equivalent to modulo the lattice it must be mapped to 
a new lattice point, say a = a + bu . This implies that ui gets mapped to a lattice 
point, (a + buj)uj = —b + (a + b)uj . This allows us to represent the automorphism as a 
real linear transformation by a 2 x 2 matrix. Since the map is invertible and takes the 
lattice to itself the determinant must be ± 1 . Orientation preserving implies that the 
determinant must be 1 , so 



It follows that (a, b) must be one of the six pairs, ±(1,0), ±(0, 1) or ±(1, —1). These 
correspond exactly to the rotations generated by multiplication by powers of u> . Notice 
that the computation of the stabilizer of uj under the action of SL-{L on the upper half 
plane is exactly the same, thus the upper half plane quotient accurately models the 
orbifold structure on Mi 1 . In particular notice that SL-iL does not act effectively on 
the upper half plane as every point is fixed by — /. This corresponds to the fact that a 
generic elliptic curve has Z2 automorphism group. Thus, PSL-{1> is definitely not the 
right group to study from the point of view of algebraic geometry. 

Exercise 5.2 Compute the automorphism group of the marked torus labeled by i. 

We will now take a detour via exercises to prove that SL2L is generated by two matrices 
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and sketch a proof that (Ti(pt))j l = X. The matrices are 

-1\ (1 1 

1 J ' \0 1 

After the detour, we will give the correct definition of the moduli stack. This encodes 
the orbifold structure of the moduli space. 

Let PSLjL be the quotient of SL{L by ±1 . This acts on the upper half plane by linear 
fractional transformations. One can define an orbifold fundamental group. This group 
should satisfy a van Kampen theorem and the orbifold fundamental group of an X/F 
should be F when X is a simply connected orbifold and the action is nice (see Ratcliffe 
[127]). 

Exercise 5.3 Conclude that nf rb (M/PSL 2 Z) = PSL 2 Z. Now use the orbifold van 
Kampen theorem and Figure 5.3 to prove that this is ([5], [VK]|[5] 2 = 1, [W] 3 = 1), 
where W = ST and [A] is the image of A G SL4L under the natural projection. 

Exercise 5.4 Check that the map from the group (s, t\s 2 = (st) ,s = 1) to SL 2 r L 
taking s to S and t to T is a well-defined group homomorphism. Show that the 
kernel of the composition of this map with projection to PSL 2 7L is 7L 2 . Conclude that 

SL 2 Z °* (s,t\s 2 = (st) 3 ,s* = 1). 

Exercise 5.5 Give a definition of a line bundle in the orbifold category. Also define 
the first Chern class of an orbifold bundle. You should see that the Chern class may be 
computed by counting zeros with sign, but one must divide each term by the order of 
the stabilizer group. Also prove that this Chern class satisfies the usual multiplication 
by degree for pull-backs. 

Now (ri(pt)) j' is just the evaluation of the first Chern class of the tautological bundle 
over M^i ■ To do this we would like to have a section of C\ . Such a section would 
associate a holomorphic form to each marked torus. It is natural to try a section 
that would associate f(r)dz to C/(l,r). Transformations by elements of Sh^L fix 
the torus and so should fix the holomorphic form. Multiplication by cr + d is an 
isomorphism C/(l,^±|) -> C/(cr + d, ar + b) = C/(l,r). It follows that we need 
fi cT+d ) = ( CT ^)/( r ) m or der for this section to be well defined. 

Definition 5.6 The Dedekind eta function is the analytic function on the upper half 
plane given by 

00 

IT IT 



n=\ 
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The following functional equation is proved in Apostol [12]. 

7 / ar + b\ 9 9 

T r-, = -ie(a,b,c,dY(cT + d)r] z (T) , 

\ct + d J 

where 

f / a + d 

e(a, b, c, d) := exp < m — h s(— a, c) 

I V u " 

and 

Er ( hr 

r=l v 

Thus r i — > i] 2 (T)dz is almost a holomorphic section of the tautological bundle C\ . In 
fact, (— ie(a, b, c, d) 2 ) 12 = 1 so r i— > rj 2 (r)dz is a well-defined section on the pull-back 
of C\ to a 12-fold branched cover of Af^i . 

Exercise 5.7 Label translates of the fundamental domain in Figure 5.3 by the group 
elements in PSL2L used to translate them. Now compute —ie([A]) 2 for each group 
element and use this as a label. Pick one translate with each of the 12 different labels. 
The collection of these translates is a fundamental domain for the 12 -fold branched 
cover of M11 . Also label the adjoining translates to figure out the identifications on the 
larger fundamental domain. 

Exercise 5.8 Show that tubular neighborhoods of the cusps in each of these 12 labeled 
translates glue together to a once-punctured disk, and the function r/ 2 (r) extends across 
this disk with a simple zero at the center. (Use the group label to translate back to the 
small fundamental domain without losing the — z'e([A]) 2 factor and introduce w = e 2n,T 
as a coordinate.) 

Adding the extra point described in Exercise 5.8 gives the 12 fold cover together with a 
section of the pull-back of L\ to this cover. The negative of the identity matrix acts 
trivially on every point of this cover. This means that the stabilizer group of every point 
in the cover is Z2 so the evaluation of the first Chern class of the pull-back bundle is ^ 
of the number of zeros which is just J . Thus since Chern classes are multiplicative 
under covers (Ti(pt))j = ^ as claimed concluding our detour. 

5.2 Moduli stacks 

The proper structure to encode the orbifold idea in algebraic geometry is a stack. One 
does not have to understand the definition of a stack to get a feel for Gromov-Witten 
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invariants so we do not include the definition here. However most sources do not even 
define a stack, so we have given the definition together with some motivating examples 
in Appendix A. The main points to keep in mind are that a stack adds extra structure 
that remembers the stabilizer groups, and that the 'universal' family is a stack. 

A family of stable curves {Ey} y6 y is encoded as a map say / from one space X to a 
second space Y such that for every point y G Y, f~ l (y) can be thought of as a stable 
curve. One thinks of this as the set of stable maps/ _1 (y) together with some topology 
linking everything together. 

To define families of stable maps, we need a technical definition. 

Definition 5.9 An R -module M is flat if — ► M ®rA — > M ®rB — ► M ®rC — > 
is exact whenever 0—>A—>B—>C— > is exact. Given a morphism between analytic 
spaces, / : X — > Y the ring of germs of analytic functions over a point xq G X (denoted 
by Ox(xo)) is a module over the corresponding ring of germs on Y, Oy(f(xo)). The 
map / is called flat if Ox(xq) is a flat Oyifixo)) -module for every xq G X. 

Intuitively a map / is flat if it has a nice fiber structure. See Hartshorne [75] for a 
discussion and examples. 

Definition 5.10 A family of stable maps is a flat morphism 7r : V — > S together with 
maps, u '. V — > X, and p^: S —* V such that / o p k = ids an d 

["L-^o)' 7r ~ 1 ( 5 o) ) PiOo), • • • , PnOo)] S M gt „(X, (3) 

for every so G 5. A morphism of families of stable maps is a pair of maps, G : V — ► V' 
and ^ : S —> S' that intertwine the structure maps, u, u! , pk and p' k . 

A universal family of stable curves is one such that there is a unique morphism of families 
from any given family into the universal one. If one sticks to families over schemes 
or analytic spaces, there is no universal family; see the discussion in Mumford [114], 
Harris-Morrison [74] or Section 10. The moduli stack is just a formal construction of 
a universal family. The resulting construction generalizes the category of schemes or 
analytic spaces. Given a space T one constructs the contravariant functor T that takes a 
scheme S to the set of all morphisms from S to T. See Appendix A or Section 10 for 
more information. 

Definition 5.11 The moduli stack, A4 gtn (X, (3) is the functor from the category of 
schemes (or analytic spaces) to the category of sets taking a scheme S to the set of 
equivalence classes of families of genus g, ra-marked stable curves representing in 
H2(X). (Recall the definition of a family of stable maps from Definition 5.10). 



Qeometry & Topology Monographs 8 (2006) 



244 



Dave Auckly and Sergiy Koshkin 



The definition of the moduli stack is very elegant and nicely avoids questions about the 
topology of moduli space. However, one still has to work to extend intersection theory 
to stacks; see Vistoli [155]. 

5.3 Deformation complexes 

Finally, we come to the main point of this section - the local structure of the moduli 
space. It is a general principle that nondegenerate spaces or maps have the same local 
structure as their linear approximations. The implicit function theorem is one version 
of this principle. We are interested in a generalization to spaces with group actions; 
see Bredon [33], Audin [17] and Atiyah-Bott [15]. It will be convenient to consider 
right G-spaces, ie assume that the group G acts on the right in contrast to the standard 
convention. 

As a warm-up we shall study the following situation. Let F : X — > Y be a smooth 
equivariant map of right G-spaces and let yo := F(xq) be a G fixed point of Y. To 
get a local model of F~ 1 (yo)/G in a neighborhood of [xo] we linearize the following 
sequence of maps, 

G^X^Y, 
where L XQ (g) := XQg. The linearization is 

T^m TF 

T\G — ° T XQ X — — > T yo Y . 

Exercise 5.12 Prove that the above sequence is a complex, that is, TF o TL XQ = 
given that yo is a fixed point. It is called the deformation complex. 

We will call the stabilizer group of xo the automorphism group of xq, Aut(jco) := 
{g G G\xog = xo}. The kernel of TL XQ is the zeroth cohomology of the deformation 
complex and it is isomorphic to the Lie algebra of the automorphism group, aut(xo) . 
Provided TF is surjective, F~ 1 (yo) will be a manifold. The cokernel of TF measures 
the failure of TF to be surjective. This cokernel is the second cohomology of the 
deformation complex and is called the obstruction space, ob(xo). Assuming that the 
obstruction space vanishes, F~ l (yo) is a manifold locally homeomorphic to the kernel 
of TF. The quotient of this manifold by the G action can be locally identified with 
the first cohomology of the deformation complex provided that the automorphism 
group is trivial. This cohomology group is called the deformation space of xq and is 
denoted by 9ef(xo). The point is that the exponential map applied to the orthogonal 
complement of the image of TL XQ in the kernel of TF is a local slice for the action 
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of G on F (yo)- In other words, it intersects each nearby G orbit in exactly one 
point. The obstruction, deformation and automorphism spaces glue together to form 
the obstruction, deformation and automorphism bundles over the space F~ l (yo)/G. 
If the obstruction space is trivial at a point, but the automorphism group is nontrivial, 
then there is a natural action of the automorphism group on the deformation space. 
Furthermore, F~ 1 (yo)/G is locally homeomorphic to the quotient of the deformation 
space by the action of the automorphism group. If the obstruction space is nontrivial, 
there is a map from the deformation space to the obstruction space and the quotient 
F~ (yo)/G is locally homeomorphic to the quotient of the inverse image of zero under 
this map by the automorphism group. This special map is called a Kuranishi map. The 
Kuranishi map is described in the following exercise for the case of trivial automorphism 
group. 

Exercise 5.13 Let F : H l © V — > H 2 © W be a smooth (non-linear) map between 
linear spaces satisfying F(0) = 0, H Y = ker(7oF), and H 2 = coker(ToF). Define a 
map $ : H l © H 2 © V H l © H 2 © W given by 

®(x,y,z) = (x,y + Fi(x,z),F 2 (x,z)). 

Use the inverse function theorem to prove that there is a locally defined inverse and 
smooth maps tp: H l —> H 2 and £ : H l — ► V defined on open neighborhoods of zero 
in H l such that <&{x, tp(x), = 0. Conclude that ip~ (0) is locally homeomorphic 
to F~ l (0). The map tp is the Kuranishi map. 

Exercise 5.14 Apply these ideas to various orbit types in the quotient of S 3 (viewed 
as the unit sphere in C 2 ) by the natural action of S 1 x S l . 

5.4 Deformations of stable maps 

We will now apply these ideas to the moduli space of stable maps. A stable map is 
specified by a complex structure on a surface, a collection of marked points and a 
/-holomorphic map into a symplectic manifold. Recall the relevant definitions from 
Section 3. We will separate the deformations of a stable curve into deformations of the 
marked points and complex structure and deformations of the map with a fixed set of 
marked points and fixed complex structure. 

Begin by considering deformations of the map with the marked points and complex 
structure fixed. By definition, a map u: S — > X is /-holomorphic if Bu = 0. The 
expression Bu may be applied to a tangent vector in T XQ T, to produce a tangent vector 
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in T U ( XQ )X. This may be reinterpreted to say that Bu G ^(A'E <g) u*TX), ie Bu is a 
1-form with values in the pullback of the tangent bundle. The expression Bu extends 
to a map from the complexified tangent space of X to the complexified tangent space 
of X. It is completely determined by an induced map from A 0,1 S to u*TX. Thus we 
usually view Bu G C 00 ^ ' 1 !] ®u*TX). 

Exercise 5.15 Let V be a real vector space with almost complex structure /, and 
let V c := V (£>r C be the complexification. The (anti)holomorphic subspace is the 
(— )/-eigenspace of J c := J ® 1 acting on V c . These are denoted by (V ' 1 ) or V 1 ' . 

(1) Show that the projection P 1 ' : V c -> V 1 ' given by 

P(X) = \{X - J ® iX) 

restricts to an isomorphism of complex vector spaces from V to V 1,0 when V is 
viewed as a complex vector space via J and is viewed as a subspace of V via 
V ® 1 . (For this reason we often identify the holomorphic subspace of V c with 
Vby P 1 ' .) 

(2) Let Bu act on the complexified tangent space of £ in the natural way. Show 
that it takes A 1,0 1! := PI! 1,0 to u*TX Q > 1 (it takes holomorphic vectors to 
antiholomorphic vectors.) 

(3) Conclude that Bu is uniquely determined by the restriction of P i,0 Bu to A 0,1 S. 

Now consider a one-parameter family of maps passing through u, say v t . The derivative 
of this family (at t = 0) associates a tangent vector in T U ( XQ )X to a point xq G S, so 
we may view vo G C 00 (m*PX). The linearization of B maps vq to the derivative of Bv t 
at ? = 0. We will abuse notation and denote this by Bvq. This may be viewed as an 
element of C oo (A 0,1 S (8) u*TX) as explained in the previous exercise. An expression 
for the linearization of B may be found in Salamon [134], McDuff-Salamon [109] and 
Audin-Lafontaine [18]. We represent it by 

(2) 8: C°°(u*TX) -» C 00 ^ ' 1 ^ ® u*TX). 

The kernel of this map is the deformation space of maps with fixed complex structure 
and marked points. It is denoted by Def(w). The space of stable maps with underlying 
marked curve [£,/?] is locally diffeomorphic to Uef(w) at [u, T,,p] provided that 
the obstruction space (cokernel of the above map) ob(u) vanishes. There are no 
automorphisms that act on maps with fixed marked domain. 

We now consider automorphisms and deformations of the underlying marked curve. 
An automorphism is just a holomorphic map ip : S — > S fixing the marked points, so 
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dip = and ip(pk) = Pk • Given a one-parameter family of maps ip t , we can differentiate 
the defining conditions of the automorphism group to obtain the conditions specifying 
infinitesimal automorphisms: dtpo = and ipo\ Pk = 0. The nice way to encode these 
holomorphic vector fields that vanish at the marked points is to use the kernel of the 
following operator, 

(3) 8: C 3 °(TE®[-pJ)->C x> (A°' 1 E®TE®[-p]). 

Here —p = —p\ — •••—/?„ is negative the divisor associated to the marked points and 
[—p] is the complex line bundle associated with this divisor; see Griffiths-Harris [68]. 
One may pick a (unique up to scale) non-zero meromorphic section of this line bundle 
with a simple pole at each marked point, say s~ . The map ipo h-> ip ® s~ identifies the 
infinitesimal automorphisms aut([S,p]) with the kernel of the above map. 

Since a complex structure satisfies J 2 = —I, the derivative of a one-parameter family 
of complex structures satisfies JqJ + J Jo = 0. For a holomorphic vector X we have 

J(JJ X) = -J(J JX) = -iJJ X . 

Thus \JJq takes holomorphic vectors to antiholomorphic vectors and may be interpreted 
as an element of A 0,1 5] ® TT, similar to the way that Bi>o may be interpreted as an 
element of A 0,1 S ® u*TX. A one-parameter family of deformations J t is trivial if there 
is a one-parameter family of reparametrizations tp t so that dtp t o / = J t o d<p t ■ 

Exercise 5.16 Show that that dip t oJ = J t o dip t implies that jJJq = dcpo . 

One also has to consider deformations of the marked points. Surprisingly, deformations 
of the marked points may be modeled by deformations of the complex structure that are 
fixed at the marked points. This is easiest to understand on the Riemann sphere CP 1 
because CP 1 has exactly one complex structure up to reparametrizations. 

Consider a collection of four or more points on CP 1 , say p = (p\, . . . ,p n )- Since we 
know that there is a unique complex structure on CP 1 up to reparametrizations on CP 1 , 
given any family of complex structures J t we may find a family of reparametrizations ip t 
so that dipt oJ = J t o dip t . The expression ip t (p) describes the associated one-parameter 
family of marked points. This generalizes to deformations of the complex structure 
and marked points on any marked curve. To summarize, each deformation of the 
equivalence class of a marked curve is uniquely specified by an element ( jJJo) of 
the cokernel of the map in equation (3). The obstruction space vanishes for dimensional 
reasons. 

One may think that it is impossible to have nontrivial automorphisms and nontrivial 
deformations of the same marked Riemann surface. This is true for smooth surfaces, 
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but it is not true for nodal curves. The marked curve in Figure 5.4 has a four-complex- 
dimensional family of automorphisms corresponding to linear reparametrizations of each 
side bubble. In addition it has a seven-complex-dimensional family of deformations, 
three for the marked points in the center bubble, two more for the locations of the nodes 
and two more for the resolutions of the side bubbles. 



Figure 5.4: A nodal curve with automorphisms and deformations 

We can now assemble the deformation complexes of a map (equation (2)) and marked 
curve (equation (3)) to obtain the deformation complex of a stable map [it, S,p]. We 
first form the double complex 

C°°(u*TX) ^ C^CA 0,1 !! <g> u*TX) 

A 



(4) du(-®s+) 



du{— 



C°°(TT, <g> [-/?]) - — ^C oo (A > l Y,®TY,(g)[-p]) 

d 

Here s + is a non-zero section of [p] with a simple zero at each marked point. A bit of 
thought allows one to conclude that the associated total complex, 

c°°(rs <g> [-p]) -» c 00 ^ ' 1 ^ ®rs® [-/?]) e c°°(u*tx) -> c 00 ^ ^ <g> u*tx) , 

is the deformation complex of the stable map [u, £,/?]. The kernel of the first 
map {{B,du(— ® s + ))) encodes the conditions that an infinitesimal automorphism 
corresponds to a one-parameter family of holomorphic maps tp t fixing the marked 
points and satisfying u o ip Q = u. The kernel of the second map (3 — du(— (8> s + )) 
encodes the fact that each map in a deformation of a stable map must be /-holomorphic 
with respect to the corresponding structure (J o du t = du t o 7 f ). 

5.5 Homological description of deformations 

Introducing a bit of homological algebra will allow us to formulate the deformation 
complex for nodal curves, where the notion of C°° sections may be less clear, and 
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will provide additional computational tools for the study of the local structure of the 
moduli space. The homological definition of the deformation complex will also have 
the added benefit of defining bundles of infinitesimal automorphisms, deformations and 
obstructions over the moduli space as opposed to just defining vector spaces attached at 
each point. The book by Weibel [156] is a good reference for the homological algebra 
that we use. 

Definition 5.17 An R -module / is called injective if the functor Hom#(— , /) is exact 
(that is, takes exact sequences to exact sequences). An injective resolution of an 
R -module M is an exact sequence, 

- M - 7 - h - • • • , 
where the Ik are injective. Given a complex of R -modules, 

A* := ► Afc — ► Ak+i —>•••• 

and an R -module B, one may take an injective resolution of B say — > B — > I* , and 
form the double complex Hom#(A,-, Ij). The hyperext group Ed^(A*,-B) is by definition 
the £th cohomology of the associated double complex. 

Notice that we have isomorphisms 

C°°(u*TX) ^Homo E ( M *O x ,C oo (A S)) 
C°°(A°' l E ® u*TX) ^Homo IJ (B*ny,C oo (A ' 1 S)) 
C°°(rS ® [-p]) ^Hom OE (ft s ([^]), C°°(A°S)) 
C^CA ' 1 ^ ® TS ® [-p]) ^Hom OlJ (nE([p]),C lx> (A ' 1 E)). 

Here is the sheaf of holomorphic functions on S, Qs(L) is the sheaf of holomorphic 
differentials on S taking values in a line bundle L and Qx is the sheaf of holomorphic 
differentials on X. Any sheaf £ may be realized as the sections of an associated sheaf 
space E — ► X. By definition, the pull-back of a sheaf under a map /: Y — > X is 
/*£ := T(f*£). 

Exercise 5.18 The third isomorphism listed above is defined by 

F(X (8) (8) := a(X>f . 
Find the other three isomorphisms. 

We can now see how to represent the infinitesimal automorphisms, deformations and 
obstructions as hyperext groups. Notice that a partition of unity argument may be 
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used to show that a sheaf of C°° sections is fine. This in turn implies that it is 
Hom0 E ((iw, — )-acyclic thus the hyperext groups can be computed with these sheaves; 
see Hartshorne [75]. Also recall that the definition of an exact sequence of sheaves 
is defined by the requirement that the sequence be exact at the level of germs. This 
means that the usual de Rham complex is an exact sequence of sheaves (even though the 
sequence of global sections fails to be exact as measured by the de Rham cohomology). 
The point is that a closed form becomes exact when it is restricted to a small enough 
open set. It follows that the following is an acyclic resolution of 0£ : 

-»■ O s -»■ C°°(A S) -► C 00 ^ '^) -► 0. 

We will use this resolution and the definition of the hyperext groups to construct the 
deformation-obstruction complex. 

5.6 The deformation-obstruction sequence 

Using the four isomorphisms and the definition of the hyperext groups we arrive at the 
following definitions which generalize our proceeding discussion for smooth stable 
maps. 

Definition 5.19 The spaces in the deformation-obstruction sequence of a stable map 
are given by: 

Oef(w) :=Ext Os (u*n x ,O s ) 

cmt([£,/>]) :=Ext^ s (O s (W),C» s ) 
0ef([S,/7]) :=Extk s (Q s (M),e> s ) 
aut([«, E,/>]) :=Ext Os (u*n x Q s (&?]), O s ) 
Vtfflu, E,p]) :=Ext^,( B *n z Q S (M), O s ) 
ob([n, E,p]) :=Ext 2 0s (u*n x C» s ) . 

Homological algebra will lead to a long exact sequence relating these groups. The 
following aside sketches the derivation of the exact sequence. 

Aside 5.20 Since the hyperext groups are defined as the cohomology of the total complex 
of a double complex and every double complex has an associated spectral sequence, there is a 
spectral sequence related to the hyperext groups. As the double complex only has two terms 
in any direction, the spectral sequence will collapse at the £2 term and produce a long exact 
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sequence. See Bott and Tu [30], Brown [34] and Spanier [148] for a detailed discussion of 
spectral sequences. Briefly, to any double complex C p,q one associates a total complex 

TC := ®k+e=nC k ' e 

filtered by 

F P TC" := (Sk+e=n,k> P C k ' e . 
The zeroth page of the spectral sequence is defined by 

E p,q •= 

Similarly the last page of the spectral sequence is defined by 

F p H p+q (TC*) 



E P J> 



FP+ l H p +i(TC*)' 



Exercise 5.21 Prove that for the spectral sequence of a double complex one has E p ^ q = C p q . 

The differentials of the double complex may be used to define differentials on the pages of 
the spectral sequence, d n : E p ' q — > E% +n ' q+ ~ n . Here the pages may be inductively defined 
by E„ + \ := H(E„,d„). Applying this to the double complex of equation (4) we see that the 
£i-page of the associated spectral sequence is given by taking the cohomology in the vertical 
direction. Combined with the definition of the deformation spaces this gives: 

= M([£,/>]) ob(u) 

1 aut([S,p]) 0ef(ii). 

Taking cohomology in the horizontal direction gives: 

kerOef([S,p]) -» ob(u)) coker(9ef([S,p]) -> ob(u)) 
2 ker(cuit([E,/j]) -> Def(n)) coker(aut( [£,/?]) Uf(u)) . 

Since all groups outside of this square are zero all other differentials are zero and we conclude 
that Eoq = Ei . Using the definition of the E^ page and the definition of the deformation 
groups we conclude that 

^ 1 = F°gO(rC*) ' E ^ = F ° Hl ( TC *) and F H\TC*) = *tf([u,X,pJ). 
Combining this with the above computation of the E^ page gives the following exact sequence: 
-> coker(aut([S,p]) -> Uf(u)) ker(5ef([E,p]) -> ob(u)) -» . 
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Exercise 5.22 Continue in this way to prove that the following deformation-obstruction 
sequence is exact. 

The deformation-obstruction sequence is 

(5) - ouf([ M , £,/?]) - aum,p]) 

Ocf( M ) dtf([u, £,p]) -► ?)ef([£,p]) 

-> ob(w) ob([w, £,p]) 

It is worth pointing out what the maps in the deformation-obstruction sequence are. 
Any automorphism of [u, is an automorphism of [£,p] , so the first map is just the 
inclusion. The second map is given by ipo i— > j t u o (£>| ;= o. The third map is v i — > (v, 0); 
the fourth is (v,B) B; the fifth is 8 ^ du(B s + ). The sixth map is just the 
projection because both ob(u) and ob([u, £,p]) are quotients of the same group, but 
the latter is a quotient by a larger equivalence. A few more comments will clarify these 
notions. 

First consider the infinitesimal automorphisms and deformations of a smooth marked 
curve. We have the following acyclic resolution of Os(rS ® [—/?]): 

o -> S (T£ ® [-/?]) -► c°°(rs <8) [-p]) -> c^ca ' 1 ^ <g> rs ® [-p]) 0. 

It follows that we may make the identifications 

aut([S,p]) *fl°(E, O s (rS ® [-p])) 
0ef([£,p]) ^(S, O s (rS [-p])) . 
The simplest version of the Riemann-Roch theorem (see Forster [58]) states that 
dim c //°(S;L) - dimc^E;!,) = ci(L)[£] + l-g. 

This gives 

dim c aut([E,p]) - dim c fef([£,/?]) 

= dime O s (rS <g> [-/>])) - dim c /^(E; O s (rE [-p])) 
= deg(rE ® [-p]) + l- g = 3-3g-B. 

This is in fact true for the deformations and infinitesimal automorphisms of any marked 
curve. For the special case of £ = CP 1 the bundle TCP 1 (g> [— p] is just the degree 2 — n 
bundle over the sphere. The Kodaira vanishing theorem (see Griffiths-Harris [68]) 
states that H q (X; Q P (L)) = for p + q > n when L has positive degree. With 3 or fewer 
marked points the Kodaira vanishing theorem implies that H l (T,, O^iTT, (g) [— p])) = 
and we conclude that dime <*ut([£,p]) = 3 — n and dime 5 ef([£,p]) = 0- O ne may 
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also directly compute H as homogenous degree 2 — n polynomials in this case. One 
sees that this matches perfectly with linear fractional transformations fixing 3 or fewer 
points. 

With three or fewer marked points H l is trivial and H° can be nontrivial. We will see 
that the situation with 4 or more marked points is just the opposite-// is trivial and 
H l can be nontrivial. To be more precise, recall that for smooth varieties there is a 
non-degenerate pairing, 

H°' k (X, 0{E)) x H°'"- k (X, 0{E* ® A n '°X)) -> C , 

given by (a, j3) i— > f x a A /3. Here the E and E* components pair to give a number 
and we have k components dz.\ from the first term, n — k from the second term and n 
components dn from the coefficients in the second term. This pairing gives the duality, 

H°' k (X, A) = (H°'"- k (X, n x <g> A))* , 

known as Kodaira-Serre duality [68]. Here Qx is the sheaf of top-dimensional 
holomorphic forms. (The bundle A n,0 X is called the canonical bundle and is denoted 
by Kx- Applying this duality to the deformations gives, 

Oef([£,/>]) ^ H°(E, ft E ® H E ([p]))* . 

The elements of this last group take the form f(z) dz (£> dz in local coordinates and are 
called quadratic differentials. 

Kodaira-Serre duality generalizes to more general projective varieties where there is a 
dualizing sheaf, denoted by u>x , so that 

E°'\X, A) (H°> n - k (X, tux 8> A))* ■ 

See Hartshorne [75]. The dualizing sheaf leads to the last family of cohomology classes 
that we will need. One can construct a bundle over the moduli space, such that the fiber 
over any point [u, £,/?] is just the space of sections of the dualizing sheaf ws). 
This bundle is called the Hodge bundle and it is denoted by E. The Chern classes of 
the Hodge bundle are called Hodge classes. The formal definition of the dualizing sheaf 
is as follows (see Hori et al [77]): 

Definition 5.23 The dualizing sheaf over a nodal curve E denoted by ws is the sheaf 
of meromoiphic differentials that: 

(1) are holomorphic away from the nodes 

(2) have at worst a pole of order one at each node branch 

(3) have residues that sum to zero on each pair of node branches. 
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Exercise 5.24 Give a formal definition of the Hodge bundle analogous to the definition 
of the tautological bundles Cu . 

According to the previous application of the Riemann-Roch theorem, for every extra 
marked point the difference between the dimensions of the automorphisms and defor- 
mations decreases by one. Returning to the example in Figure 5.4, we see that adding 
one point to the left bubble would reduce the dimension of the automorphism group 
by one because the restriction of the automorphism to the left bubble would have to 
fix two points, not just one as before. However, this would not change the space of 
deformations. One might think that a possible position of the new marked point needs 
to be taken into account. But any motion of the new point is a trivial deformation. 
Indeed, an automorphism changing its position can can be applied to the space before 
adding the point. A similar thing occurs with the addition of a second marked point to 
the left bubble. When one adds a third marked point to the left bubble, the dimension 
of the group of automorphisms will not change from the dimension with two marked 
points because there are no automorphisms acting nontrivially on the left bubble (such 
an automorphism would have to fix at least three points). There would be an extra 
deformation corresponding to changing the location of the third point. 

We now have a fairly good local description of the moduli stack. Since the moduli 
space models stable curves there are no infinitesimal automorphisms. Assuming that 
the obstruction space vanishes, we see that the moduli space is locally isomorphic to the 
quotient of Oef([«, £,/?]) by a finite group. The deformation-obstruction sequence can be 
combined with the Riemann-Roch theorem to compute the dimension of d ef([w, X,/?]). 
This is called the virtual dimension of the moduli space. 

Exercise 5.25 Recall that the alternating sum of the dimensions of an exact sequence 
of spaces is zero. Use the Riemann-Roch theorem 

Y^(-l) k dim c H k (X,E) = Td(7X)ch(£)[X] 

k=0 

to compute dime Ocf(«) — dime ob(u). Combine this with our earlier computation of 

dim c aut([E,p]) - dim c Oef([S,p]) 
and the deformation-obstruction sequence to conclude 

virdim c M g>n (X, ft = f Cl (TX) + (dim c X - 3)(1 -g) + n. 
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6 Localization 

We have come a long way in our review of Gromov-Witten invariants. We described all 
the technical elements in their definition with the exception of the virtual fundamental 
class (addressed in a later subsection). We also performed a number of non-trivial 
computations via recursion or direct reasoning. In this section we will describe a new 
computational tool called localization used to compute the Gromov-Witten invariants 
of the resolved conifold X S 3 . We start with a general description of localization and an 
outline of the virtual localization formula for Gromov-Witten invariants. Some readers 
may prefer to skip down to the sample computations that we give for CP 2 after the 
general discussion. 



6.1 The Umkehrung 



Localization is a technique reducing a computation of an integral over a higher- 
dimensional space to an integral over a lower-dimensional space. Of course this is 
impossible in general, but it is instructive to try. 

Given an inclusion (or any map) ip\ F M, we have the well-known pull-back 
i%: H k {M) -> H k (F), given on the level of Poincare duals by the inverse image. There 
is a less well-known push-forward ip\ : H k {F) — » H k+m ~J (M) defined for any map 
between oriented manifolds. This map (called the Umkehrung) is defined by the 
following diagram. 

H k (F) >H k+m -f(M) 



PD 



PD~ 



Hf- k (F) 



-nr. — >H f -m 

Here m is the dimension of M, f is the dimension of F and PD is Poincare duality. 
We will apply these ideas to orbifolds. For orbifolds everything goes through as in the 
manifold case provided one uses rational coefficients. 

There are nice descriptions of the Umkehrung for fibrations and embeddings. As an 
example, if -km '■ M — ► pt, the map ttmi '■ H m (M) — > H°(pt) is just given by integration 
f\a = f M a. When ir : E — ► M is a fiber bundle, the Umkehrung is just integration 
over the fiber (see Bott and Tu [30]). 

In the case of an embedding Lp\ F M, we can compute ip\ 1 and get an interesting 
answer. The cycle dual to 1 in F is just F and so the cycle dual to lf\1 is just the image 
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of F in M which is the zero section of the normal bundle to F in M. Recall that the 
image of the zero section of a bundle is dual to the Thorn class . The Euler class of the 
bundle is the pull-back under any section of the Thom class [30]. It follows that 

L* F L F d = e(N(F)) . 

Remark 6.1 The Umkehrung for an arbitrary map can be decomposed into one for 
an embedding and one for a projection. Stalling with a cycle in F dual to the given 
cohomology class one obtains a cycle in F x M by the natural inclusion F <— » F x M 
taking x to (x, Lf(x)) . Given this cycle in F x M one obtains a cycle in M by composition 
with the projection. The class dual to this final cycle is the value of the Umkehrung. 

If one could invert the Euler class, one might hope that the maps ^mpyj '■ H*(M) — ► 
H*(F) and ip\ : H*(F) — ► H*(M) would be inverses, thus reducing integrals over M 
to integrals over F . Of course this is too much to expect in general. Unfortunately, 
cohomology classes of positive degree on a finite-dimensional manifold are all nilpotent 
(since high enough powers would be forms of degree larger than the dimension of the 
manifold) and therefore not invertible. 

6.2 Equivariant cohomology 

It is much more reasonable to expect such a reduction to work if everything is invariant 
under a group action because as we will see, equivariant cohomology has more invertible 
elements. The insight of Atiyah and Bott [15] was that this could be made to work when 
F is the fixed point locus of a torus action on M. The geometric intuition behind the 
reduction of an integral over a larger set to an integral over a smaller set for equivariant 
functions is that the symmetry allows one to sample their values at a smaller collection 
of points. 

We now need a brief review of equivariant cohomology. If G is any Lie group and 
EG is a contractible, free, right, proper G-space and M is any left G-space one can 
form the twisted product EG XqM : = EG x Mj ~ where (eg~ 1 , gm) ~ (e,m). The 
equivariant cohomology of M is defined to be 

H* G {M) :=H*{EGx G M). 

As an example take G = T 2 , then EG = S°° x 5°° where S°° is viewed as 
the unit sphere is an infinite-dimensional complex space and the action is given by 
(xo,x\) ■ (Ao, Ai) := {xq\q,x\\\). The equivariant cohomology of a point is computed 
as follows [15], 

H* T2 (pt;Q) = H*(CP°° x CP°°;Q) Q[a ,ai] . 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



257 



Notice that CP 00 is infinite dimensional and a\ have degree 2 . Given a representation 
\i\ G — > GL n C we get a vector bundle £G Xc C" ^ £G pt, and associated 
equivariant classes given by the Chern classes of this bundle Ck(EG Xg C n ). If 
: J" — ► GLiC is the projection to the &th factor and |u| is the dual representation 
then the first Chern class of the line bundle associated to is the standard generator au 
of H* 2 (pt; Q). This corresponds to the fact that the first Chern class of the tautological 
line bundle over projective space evaluates to — 1 on a standardly oriented generator of 
the second homology. In fact homogeneous polynomials of degree d may be regarded 
as sections of the degree d (as measured by the first Chern class) line bundle over 
projective space and this is where all of the sign conventions come from. 

A second example of equivariant cohomology is the equivariant cohomology of the 
group with the natural left action. The result is 

H* G {G) := H*(EG x G G) = H*(EG) = <Q> . 

Thus when the action was free the equivariant cohomology was trivial and when the 
action was trivial the equivariant cohomology was interesting. In some sense equivariant 
cohomology is generated by the fixed point set of the action. 

From here on forward we will work with torus actions only and T will always denote a 
torus. In order to make the observation that the equivariant cohomology is generated by 
the fixed point set more precise and follow the reduction outline from the beginning of 
this section we need to invert elements of the equivariant cohomology and ultimately 
invert the Euler class of the normal bundle to the fixed point locus. Notice that 
#7«+i(pt) = Q[ao, • • • ,« n ] is an integral domain. This is in stark contrast to the 
cohomology of finite dimensional manifolds. We let 

Fjn+l = Q(«0, ■ ■ •,£*„) 

be the associated fraction field. The obvious map EG XgM^ EG x q pt induces a 
map //£(pt) — ► Hq{M) giving Hg(M) the structure of an H^ipt) -module. This leads 
us to the first version of the theorem of Atiyah and Bott. 

Theorem 6.2 Let Fix be the fixed point locus of a torus T action on M . The map 
ET x j Fix — > ET x j M induces an isomorphism 

t* Fix : H*(M) ® H * ipt ) F* H*(Fix) ® H?(pt) F* . 

Atiyah and Bott actually state a more refined theorem that specifies which elements 
need to be inverted in order to obtain an isomorphism. The proof of this theorem is to 
use the formula l* f lf\ = e(N(F)) to see that the maps ipix! and Q := Y^,f e(N(F)) WQ 
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inverses of each other. Here F represents a component of Fix and the sum is taken over 
all such components. 

In order to apply these ideas to the integration of ordinary cohomology classes on M, 
notice that the map j: M — > ET x T M taking x to the equivalence class of (eo,x) 
induces a map Hj(M) — > H*(M) via pull-back. A class in the image of this map is called 
an equivariant class. One standard way to construct equivariant classes is to start with 
an equivariant vector bundle E —> M and take characteristic classes. An equivariant 
bundle is just a vector bundle together with a torus action compatible with the bundle 
structure. To any such bundle one can associate the pull-back of the induced bundle 
ET x T E — > ET x T M. Any characteristic class of ET x j E is then an equivariant class. 
Given any equivariant class cf> £ H m (M) with equivariant lift q> £ Hf(M) we have 



(6) 4> = n M \<f> = n M \f$ = ^M\$ = ^M^Kx! ^2 

JM p 



l* f 4> 



,. e(N(F)) 

Lp4> f l* F (f> 



This is the standard way of thinking about the Atiyah-Bott localization formula. 



If e(N(F)) 



6.3 Equivariant cohomology of CP" 

To use the localization formula (6) to integrate a cohomology class <fi one must find 
an equivariant lift 4> tnat maps to (j). The standard way to do this is to express <f) as 
a product of characteristic classes of vector bundles and then extend the group action 
over the vector bundles. Such an extension is called a linearization of the action. For 
example, we can represent the hyperplane bundle over CP" by (C" +1 — {0}) x C/ ~ 
with (zo, • • ■ ,Z n , ~ (wzq, • • • , wz n , vv£). Define a family of linearizations of this 
bundle by A • (zq, ■ . . ,z n , '■= (AoZo, ■■■ , Kz n , Aq° • • ■ Xn'O- Denoting the hyperplane 
bundle with this linearization by L qo qn , we have 

ci(£§o,-,<?J = h~ <?oao q n a n ■ 

This serves to define equivariant cohomology classes of CP" . The class h is defined to 
be the first Chern class of the bundle Lo,. ..,(), an ^ a k i s defined as a difference of Chern 
classes. Let us outline a proof that 

tf*(CP") = ®[h, Q , . . . , «„]/ (f[(h - at) ) • 

\k=0 J 
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First consider a finite-dimensional model X p ^ for ET Xj CP P . If T = T n+l acts on 
CP P in the usual way then 

{([xol [x n ]; [y : . . . : y p ]) € (CP W )" +1 x CP^ 1 ^ 1 ^ 1 | A 2 [*] = 0}. 

A specific case helps explain what this means. Take n = 2, p = 2, and N = 4. We 
will write [* ] = [xq : x t : x 2 : x 3 : x 4 ] , [x\] = \y : . . . : y 4 ] , [x 2 ] = [zo ■ ■ ■ ■ : Za\ and 

\yo y2\ = [po : p\ : P2 ■ P3 ■ P4 ■ qo ■ q\ ■ q2 ■ q3 ■ q4 ■ r : n : r 2 : r 3 : r 4 ] 

The condition on the second wedge states that all 2 x 2 determinants pairing x's and 
p's etc are zero, so 



XQ 


x 3 


= o, 


Z2 


23 


P0 


P3 









= 0, . . . . 



To see where this comes from note that there is a natural action of S 1 on S 2N+l with 
quotient CP^. In the N -> oo limit we see that 5°° is contractible, so ES l = S°° and 
BS 1 = CP°°. Thus (CP^)'^ 1 is a finite-dimensional model for BT. The condition 
on the vanishing of the second exterior powers implies that yo is proportional to 
xq, etc. This means that the inverse image of a point under the natural projection 
Xp, n ,N -> (CP^)^ 1 is a copy of CP P . The class h is Poincare dual to 

H := {([x ], . . . , [x n ]; \y : . . . : y p ]) € X p ^ N \(x p ,y p ) = 0} , 

and the class a\ is Poincare dual to 

Ak ■■= {([xol, • • • , [x n ]; b>o ■ ■ ■ ■ ■ y P Y) e ^p,«,/v|fek = 0} . 

These formulas can serve as alternative definitions of these classes. The following 
exercise will prove that 

H* T (X P ,n, N ) = Q[h, ao, . . . , a n y (f[(h - a k , a$+ 1 ,. . . , c^ +1 ) j . 

\k=o J 

Exercise 6.3 Prove that the cohomology rings of X p ^ n ^ and CP' 1 take the stated form. 
Note for example that ^2,4 is a 2 8 -real-dimensional space with one 28-cell, no 
27-cells and \ = ^1,2,4 U (J/t=(At- ^ n f act ' ^ P ro P eri y taking complements of 
unions of intersections of the A# and H cycles, X p ^ may be decomposed into a union 
of cells. It follows that the cohomology group of X p ^ is as stated. The ring structure 
follows from the combinatorics of the intersections of the and H cycles. 



Qeometry & Topology Monographs 8 (2006) 



260 



Dave Auckly and Sergiy Koshkin 



Exercise 6.4 Let qk be the point in CP m with all coordinates other than z.k equal to 
zero. These points are fixed by the standard J-action, so there is an equivariant class 
<fik Poincare dual to q^. By considering appropriate intersections of the and H prove 
that 

4>k = \\(h - at/) . 
i¥* 

Let ik : pt — ► CP m be the map with image . We compute 

i\h = [ PD(q k )h= [ PD(q k )(h - a k ) + f PD{q k )a k = a k . 

JCP" JCP" JCP"' 

The above computation may look a bit weird. It appears that we are restricting a 2-form 
to a point and getting a non-zero answer. The thing to remember here is that we are 
working equivariantly so we replace qk by ET Xj qk and CP"' by ET x j CP m , and we 
have tensored with Q(ao> • • • > &m)- 

7 Localization computations of Gromov-Witten invariants 

As with many parts of this theory Kontsevich was the first to apply localization to 
Gromov-Witten invariants [90]. We are following the exposition from Hori et al 
[77] and Cox and Katz [43]. To see how this applies to Gromov-Witten invariants, 
notice that a T action on X will induce a T action on M. g n (X, f5) via composition 
A • [u, S,p] := [Am, S,p] . In order to better demonstrate how localization may be used 
to compute Gromov-Witten invariants we will recompute some of the Gromov-Witten 
invariants of CP . This is not a very efficient way to compute these invariants but it 
will allow us to explain the important points. 

We should point out that thus far our discussion of localization only applies to well- 
behaved spaces with torus actions. In the Gromov-Witten setting this will only occur 
when the automorphism group of every stable map in the relevant moduli space is trivial 
and the obstruction space over each stable curve is zero as well, that is, Aut([M,£,p]) = 1 
and ob([u, a,p]) = 0. When the obstruction spaces are still trivial but the automorphism 
groups are not, one must generalize the localization formula to the orbifold setting. 
When the obstruction spaces do not vanish, one must define virtual fundamental cycles 
and prove that localization works for these virtual cycles. In this section we will ignore 
the more technical points and discuss localization in the context of Gromov-Witten 
invariants as though everything were smooth and automorphism-free. Take heart, we 
will give an introduction to virtual fundamental cycles later in this section and (virtual) 
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localization really does work in this context as was shown by Graber and Pandharipande 
[66]. 

We have a standard T 3 action on CP 2 given by 

(A , Ai, A 2 ) • [za : zi : z.2] ■= [Aozo : Aizi : A2Z2L 

The first step is to find the fixed point set of the T action on M. gjn (CP , d). The subtle 
point is that if one looks for maps as opposed to equivalence classes of maps fixed by 
T there will be none (unless d = in which case a constant map with value a fixed 
point in CP 2 would be fixed). Thus one must remember to look for fixed equivalence 
classes. This means that given a stable map [u, and a A G T one must find an 
automorphism ip of the underlying marked surface so that Aw = u o ip. 

Example 7.1 An example of a genus zero degree three stable map fixed by the natural 
T action is [u, CP 1 ] where u([xq : xi]) = [xjj : x\ : 0]. This is a stable map because the 
only automorphisms are given by (p([xo '■ xi]) = [£xo : xi] where £ 3 = 1. It is fixed 
by every A G T because the map ip([xo : xi]) = [^o^o : induces an equivalence 
between [A ■ u, CP 1 ] and [u, CP 1 ] where £q = Ao and ^ = Ai . 

Remark 7.2 A stable map fixed by T has an infinite number of left symmetries. 
One should not confuse this with the requirement of at most a finite number of right 
symmetries from the definition of a stable map. The first is the T action by post- 
composition. The second is pre-composition by an automorphism of the marked 
surface. 

In general, a component of the fixed point set of the T action on M. g n (<CV 2 ,d) 
can be described by a labeled graph, denoted Y . The labels on the graphs and the 
correspondence between labeled graphs and components of the fixed point set will 
be described in Section 7.1. See Figure 7.1 for a labeled graph and element of the 
corresponding fixed point component. 

4 1 

5 2 

6 3 

7 8 

Figure 7.1: A labeled graph and element 
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Applying the localization formula (6) to the Gromov-Witten invariants of CP m gives 



(7) //,'. //«\CP m , -V — ! n ^ ^ {u(Pj}) 

The notations used in this formula are explained further in the next article. Here 
h is an equivariant lift of the class h. The standard lift is also denoted by h, in 
which case h\k( Uj ) = More generally, lifts of h take the form h + ^ ajccj and 

(h + Y1 a i a j)\k(uj) = a k(ui) + zC a j a i ■ We nave to divide each term by a group factor to 
take into account the fact that the moduli space is a stack (think orbifold). The group 
Ar is the automorphism group of a generic element of the fixed point set T. The 
automorphisms of the labeled graph Aut(T) act on the edges of the graph and therefore 
on the group \\ e r Ld(e)- The group Ar is defined as the following semidirect product: 



A r := Aut(T) * ( JJ: 



'd{e) 



Formula (7) gives the Gromov-Witten invariants of CP m for any genus. For genus zero 
one just has to include the automorphism groups; for higher genus one also has to apply 
virtual localization. Of course, there is still work to do to get numbers from this formula. 
In particular we still have to compute the Euler class e(N™) . 

The next step in the localization computation is the computation of the Euler class of the 
normal bundle to each component of the fixed point set. We have seen that the tangent 
space to the moduli stack at a stable curve is def([u, £,/?]). Assuming that [u, 
is a fixed point of the T action there will be an induced action on the tangent space. 
This linear space decomposes into a collection of irreducible T representations. The 
tangent space to the fixed point set at [u, £,/?] is the sum of the trivial T representations 
because the fixed point set is well, fixed. The normal space is therefore the sum of the 
nontrivial representations and is denoted by dtf([u, S,p]) mov . In fact, we will see that 
the T action extends to all of the spaces in the deformation-obstruction complex, so 
the representation def([u, S,p]) mov may be deduced from similar parts from the other 
terms in the deformation-obstruction complex. 

Recall that given an exact sequence of vector bundles 

-> Eq -> Ei -> • • • -> E 2n +i -> , 

one has the relation Y\k=o e (^2k) = Ylk=o e (^2k+i)- Applying this to the moving part 
of the deformation-obstruction sequence (5), we obtain 

e(0ef (u) mov )e(d ef([S , p]) mov )e(ob([u, S , p]) mm ) 



e(M([ M ,£,p]) mov ) 



e(aut([S,p]) mov )e(ob(M) mov ) 
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This last formula divides the computation of the Euler class of the normal bundle 
required for localization calculations into more tractable parts. Instead of analyzing all 
deformations of a stable map we are able to analyze deformations of the map that fix 
the marked curve, deformations of the marked curve, etc separately. This program is 
carried out in detail in Sections 8.1, 8.2, 8.3 and 8.4 where we compute e(out([S,p]) mov ) 
first, describe the torus actions second, compute e(3ef([E,/?]) mov ) third, and the ratio 
e(def(u) mm )/e(ob(u) mm ) last. The term e{ob{[u, E,p]) m0Y ) is addressed in Section 11. 
Before embarking on this program we collect all of the resulting formulas and provide 
two examples. 

7.1 Representation of fixed point components by graphs 

We now describe the correspondence between components of the fixed point set and 
labeled graphs. This article introduces the notation used throughout this and the 
following sections. Recall that the domain of a stable map is a prestable genus g curve 
with marked points. A stable map fixed by the standard torus action maps each node of 
the curve to one of the points qi := [0 : . . . : 1 : . . . : 0], which are the points fixed by 
the standard action on CP'" . Some irreducible components of the prestable curve are 
mapped entirely to a single qi . We call such components contracted components or 
ghost bubbles. Note that there are no non-constant maps of higher genus curves into 
CP m fixed by the standard torus action. Therefore if a component is not contracted it 
has to be a copy of CP 1 that is mapped onto a projective line containing exactly two of 
the qi fixed points. Notice that any two stable maps in the same component of the fixed 
point set have the same non-contracted CP 1 configuration. 

Thus we can describe a component of the fixed point set by a graph with edges corre- 
sponding to non-contracted components and vertices corresponding to the components 
of the preimages of the qi . Each edge e is labeled with a positive integer d(e) indicating 
the degree of the map and each vertex is labeled with one of the fixed points ■ In 
addition, marked points are listed in columns under the vertices (see Figure 7. 1). Finally, 
contracted components unlike non-contracted ones, can have higher genus and in 
principle this has to be indicated at the vertices as well. We adopt the convention that 
the absence of such a label corresponds to genus . In particular, in this subsection we 
are only concerned with the Gromov-Witten invariants of rational curves and no genus 
labeling is necessary. Table 7. 1 summarizes the notation used to label these graphs. 

Remark 7.3 One can visualize a flag by drawing an arrow on the edge; the source of 
the arrow along with the edge is the flag. Notice that this use of the term flag agrees 
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with the usual definition in terms of increasing sequences of subspaces (see Harris [73] 
and Griffiths-Harris [68]) because a vertex in the graph corresponds to a point in CP'" 
which is a 1 -dimensional subspace of C m+1 and an edge in the graph corresponds to a 
line in CP m which is a 2-dimensional subspace of C'" +1 . 

Example 7.4 For the graph in Figure 7.1 let v denote the leftmost vertex. Then 
val(v) = 1, d(y) = 1, n(v) = 4, k{v) = 0, k(v') = 1, g(v)=0, and Jfc(n(p 7 )) = 0. 



Notation 
val(v) 



n(y) 
k(v) 



g(v) 
d(e) 



k(u(pj)) 



Description 

The valence of the vertex v, that is, the number of edges incident to 
it. If val(v) = 1 we let v' denote the vertex on the other side of 
this single edge and if val(v) = 2 let vi and V2 denote the other 
two vertices on the two edges. 

The number of marked points listed under the vertex. Some 

authors draw graphs with 'legs' to indicate the marked points. 

The index of the fixed point in CP" 1 corresponding to the 

vertex, for example if v is labeled by qi then k(v) = i. 

The first Chern class of the line bundle associated to the dual of 

the representation given by projection to the kth factor of the 

torus. 

The genus of the contracted component associated to the vertex. 
We set g(y) = if there is no such component at the vertex. 
The degree of the map of the non-contracted component 
corresponding to edge e. If v has valence one d(v) will be the 
degree of the unique edge meeting v. If v has valence two d{e\) 
and d{ei) will denote the degrees of the edges containing vi and 
V2 respectively. 

A flag in the graph of a fixed point component, that is, a pair of a 
vertex and an incident edge. We will use the flag to denote the 
corresponding vertex or point on the prestable curve or image 
point in CP m without comment. 
The label image of the 7th marked point. 



Table 7.1: Graph labels 



We are now ready to present the formulas that are used in localization computations of 
Gromov-Witten invariants of CP'" . 
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Remark 7.5 We note that the index j in (1 1)— (13) runs over all possible values, not 
just the qj depicted on the graph. For example, if we are in CP 3 one should take into 
account terms with j = 3 even when ^3 is not on the graph. See Remarks 8.5 and 8.7 
in the proofs. 



7.2 Formulas used in localization 



(8) e(aut([£,/?]) mov ) = J] 



a*(v) — oik(v') 



d(v) 

val(v)=l 
n(v)=0 

g(v)=0 



(9) e(um,p]) m ° V )= TT f oc k(y) -a m) + a Kv) -a Kv2) 

11 \ die]) 



d(ei) d(e 2 ) 

val(v)— 2 

g(v)=o n (— ^ — **y 

1Q e(oef(w) mov ) _ e(//°(s,o g (^ M *rcp 2 )) mov ) 



e{ob{uT™) e ( H i o £ (^* M *rCP 2 )) mov ) n^(r„ (c) CP 2 ) ' 

(11) n < T <c)Cp 2 )) = n n - «/)• 

(12) e (//°(s,o( M *rcp 2 )) mov ) = 

1111 (««v) - aj) 1 1 d 7^M7) ( a *W - a«V)J 

(13) g(H 1 (s,c>(i/*M*rcp 2 )) mov ) = goo 

II II ^c^Kaw-ajr*. 

vai(v)+n(v)+2g(v)>2 j^kfy) i=Q 

(14) e(N r ) = e(dzf([u,Z,p]) mm ) = 

e(oef(w) mov Mi)ef([S,p]) mov Mob([M,S,p]) mov ) 
e(out([S,p]) mov Mob(M) mov ) 

(15) ( " -* Wp 1 ] - \ |Ar| 7r <**r) " 
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Remark 7.6 In the localization formula (15) the sum is taken over all labeled graphs 
with n marked points having the correct genus and degree. The integral is taken over 
the moduli space .Mr of all stable maps having the given graph. To evaluate these 
integrals one has to expand into an infinite series in the i/j classes and integrate 
the (finite number of) terms of top degree. Since the values of the integral are in the 
equivariant cohomology ring of CP m the Chern classes at play the role of constants 
and can simply be factored out of the integral. Thus, actual integration is only required 
when one of the descendant classes ipp from (9) is non-zero in which case we can 
use the string and dilaton equations of Section 4.3 to evaluate the integral. Sample 
computations in the next article will clarify the details and should convince the reader 
that it is possible to extract useful information from these cumbersome formulas. 



7.3 Small degree invariants of rational curves in CP 2 

In this article we demonstrate the localization formulas by computing the genus zero 
invariants in degree one, two and three. We start with 

where h is the Poincare dual to the hyperplane class. We can lift h to an equivariant 
class that we denote by the same letter. Intuitively, we are looking for the number of 
lines through two generic points in CP so you can guess that the answer is one, but it 
is instructive to see how localization produces this answer. 

We are working with fixed stable maps in .Mo,2(CP 2 , [CP 1 ]). Since the overall degree 
is one, our graph can only have one edge and it will be labeled with a 1 . The two 
vertices are labeled with two of the three points qo = [1 : : 0], q\ = [0 : 1 : 0], 
<?2 = [0 : : 1]. We also have to distribute the 2 marked points between the two 
vertices and there are two essentially different ways to do this: to place the marked 
points at the same vertex or at different vertices. This leads to two different labeled 
graph types. There are a total of 12 graphs to consider, 6 of type one and 6 of type two. 
Due to the obvious symmetry, contributions from all graphs within a type are similar. 

For the first type we consider the graph labeled by q$ and qi with the two marked 
points placed at go- The only automorphisms are trivial so |Aut(Er)| = 1 and since 
both marked points are mapped to go we have i*&v\h 2 = and L*ev\h 2 = a 2 ,. 

Following (8)— (14) we get 

e(aut([Z,p]r m ) = "^p = Q! - ao 
e(dcm,p]r m ) = 1 • 1 • ("o^i _ ^ = ao - ai 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



267 



Recall that the ^-classes are the first Chern classes of the tautological bundles. In this 
case the components of the fixed point set are points so all ^-classes vanish. 

l[e(T u(c) CP 2 )) = {ao-a l ){a -a2>{a l -ao){a l -a 2 )e{H\t, 0(u*TCP 2 )) mov ) 

(-\)\V.) 2 2i 
= (a -a\)(ao-a 2 ) ■ (ai— ao)(ai— ai) • ^ (ao— a\) (ckq— OL2)\a.\-ai) 

e(H l {Y>, 0{v*u*TCV 2 )) mow ) = 1, since all vertices have genus 0. 

In fact we could have seen that e(H l (E, 0(u*TCP 2 )) mov ) = 1 directly since the fiber 
of the obstruction bundle ob(u) is H^T,, E («*rCP 2 )) = 0. Using the deformation- 
obstruction sequence (5) this implies that e(ob(u) mov ) = as well. Continuing, 

g(def(M) ) _ _ ^ _ a )( a _ a2 ) 
<?(ob(w) ) 

CtQ — OL\ j 

e(N r ) = • -(a - «i) (a - a 2 ){a\ - a 2 ) 

OL\ — CtQ 

= (ao — ot\) (ao — ot 2 ){a\ — a 2 ) . 

The computation of the Euler class of the normal bundle for the second graph type is 
analogous and we leave it as an exercise. 

Exercise 7.7 Repeat the above computation for the one-edge graph labeled with go 
and pi labeling one vertex and q\ and p 2 labeling the other. You should get 

e{Nv) = -(ao - ai) 2 («o - a 2 )(ai - a 2 ). 

The Euler classes of the normal bundles for the other graphs can be obtained from the 
first two examples by symmetry considerations. There are no V'-classes to integrate so 
the integral in (15) can be dropped. It is convenient to first sum up all the contributions 
from graphs labeled with qo,q\- There are four such graphs and the remaining two can 
be obtained from the ones we already computed by switching ao and a\ . By (15) the 
contribution to A^i from these four graphs is 

(«o • «o + a i ' a i) 



(ao — ai) 2 (ao — ot 2 ){ot\ — a 2 ) 

1 9 9 9 9 

(a •«! + «!• a ) 



(ao — ai) 2 (ao — a 2 )(ai — a 2 ) 

(«o ~ a i) 2 



(ao — ai) 2 (ao — a 2 )(ai — a 2 ) 
There are eight more graphs to account for, four labeled with go 5 #2 and four labeled 
with qi,q 2 . The joint contributions of each four are obtained from the last expression 
by applying the obvious substitutions. 
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Exercise 7.8 Add up the three fractions and get A^i = 1 as expected. 

We can repeat the previous computation with different linearizations (lifts to Hj(CP 2 ) 
of the class h). For example, since h is the first Chern class of the 0(1) line bundle 
we can construct lifts as the equivariant Chern classes of the same line bundle with 
various group actions. The lift with the same name comes from the first Chern class of 
the standard extension of the T action on CP 2 to this bundle. We may take the tensor 
product with a trivial line bundle with the /ii -action to change the action. This does 
not change the Chern class but it does change the equivariant lift to h = h — a\ . If we 
use (h — a\)(h — aj) as the linearization for the cohomology class corresponding to the 
first marked point and (h — ao)(h — a-i) for the second marked point then only one of 
the 12 graphs will contribute to the sum. Namely, the graph with one vertex labeled 
with go and p\ and the other vertex labeled with q\ and p2- 

Exercise 7.9 Repeat the computation of 

using the (h — a\)(h — ai) and (h — ao)(h — a?) linearizations. 

Generally speaking components of the fixed point set of a torus action on a moduli space 
of stable curves can be expressed as a finite quotient of a product of moduli spaces of 
stable maps into the fixed point set of the target manifold. The best way to understand 
this is to work out some less trivial examples. 

To demonstrate a less trivial localization computation we will conclude the subsection 
with a computation of 

N 2 := {h 2 h 2 hW)^ [Cpl] 
and some of the terms from the computation of 

N 3 := (h 2 h 2 h 2 h 2 h 2 h 2 h 2 h 2 )%f [Cply 

It is convenient to use the linearization (h — ot\){h — ai) on the class associated to each 
marked point. This forces all of the marked points to be mapped to qo if a graph is 
going to contribute to the sum. 

The graphs that contribute to the N2 computation come in one of two types. We label 
these two types by 7(&o^i) and //(^o^i^2)- The corresponding graphs are displayed in 
Figure 7.2 along with the graphs that contribute to the ./V3 computation. 

Each graph type contains all of the graphs obtained by an admissible labeling. An 
admissible labeling consists of assigning each kj a value of 0, 1 or 2 such that one of 
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the kj is zero and no adjacent two are equal. In addition an admissible labeling includes 
an assignment of the five marked points to vertices labeled with a zero. The vertex 
labeled with kj is really labeled by ; the £ -labels just avoid the double index notation 
and this is consistent with our use of k in the formulas. 

Exercise 7.10 Compute the contributions to N2 of several graphs from Figure 7.2. 
You can check your answer with the results listed in Appendix B. Notice that the sum 
of all the contributions is equal to one as we outlined in Exercise 4.8. 

This last exercise may be difficult, but we feel that the interested reader will learn more 
by doing it than just reading the answer. We won't feel too guilty since the answer is in 
an appendix. We will provide more explanation for some of the contributions to Nj, . 

The graphs that can contribute to N3 come in one of four different types. These types 
are displayed in Figure 7.2. Once again each graph type contains all of the graphs 

mh) . — . nmhhh) • — ■■ — / — ■ 

k jfcj k ° kl kl ki 



Ii(k hk2) , , , iV(k hk 2 h) 

k k x k 2 




k\ 

Figure 7.2: Graph types 

obtained by an admissible labeling as described in the N2 case. This time an admissible 
labeling includes an assignment of the eight marked points to vertices labeled with a 
zero. 

As an example of this notation the graph and curve displayed in Figure 7.1 are of type 
III. 

Example 7.11 To be specific the graph from the figure is 77/(0102) with the marked 
points p^, ps, p(, and pi on the ko vertex. In this case the moduli stack of the fixed 
point component is given by 

Mr = M),5(pt,0) x !Mo,6(pt,0), 
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and the inclusion map : -Mr — * -Mo,s(CP 2 , 3[CP ! ]) is given by 

w, (EUE') |J (CP 1 x {1,2,3})/ ~,p' 3 ,p' 4 , p' 5 , p 2 , P3,P4,P5,Pe 

Pl =([0:l],l) 
4=([0:1],3) 

where ([1 : 0], 1) ~ ([0 : 1],2), ([1 : 0],2) ~ ([0 : 1],3) and the map u is given by 
u([xo : xi], 1) = [xi : x : 0], k([xo : xi], 2) = [xo : xi : 0], 

k([xo : xi], 3) = [xi : : x ], w|(eu£') = [1:0:0]. 

Applying formulas (8)— (14) to the labeled graph from Figure 7.1 gives 
e(N r )~ l = -\(a -aiy 4 (a -a2)~ 2 (ai-a 2 )~ 2 

I {ao-ai-ipiy 1 ■ I (a -ai-ifjiy\aQ-a 2 -ip2)~ 1 

Now consider the second integral that appears in the above expression. We compute 
(ai - ipi)~\a 2 - tp2)~ l 

[M ,„(pt,0)]™ 



,. I 00 \ / 00 

J[M ,MV" \ p=Q ) \ q=0 

p+q=n-3 J\M^(vt.0W" 



[M , B (pt,0)]™ 

-lyi-3 



= a l i a 2 l ^2 ( ja l p a 2 ' 1 = a l i a 2 l (a l l + a 2 l ) n 
p+q=n-3 V ^ ^ / 

The last line in this computation used the result from Example 4.9. 

The graph 77/(0102) has no nontrivial automorphisms and each edge has degree one, 
so this component of the fixed point set has only trivial automorphisms. Combining 
the two previous computations with the (h — ct\)(h — ai) linearization allows us to 
conclude that the contribution of the graph 111(0, 1,0,2) with the marked points p 4 , 
Ps , p(, and pi on the ko vertex is 

-|(ao - ai)~ 4 (ao - a2) 5 (ai - a2)" 2 ((«o - ai)™ 1 + (ao - a2) -1 ) 3 . 
Of course there are ( 4 ) ways to choose four marked points for the first vertex. This 
means that the contribution of such graphs is 

' (ao — ai) _J (ao — «2) z («i — a2)~ z (2ao — ol\ — a.2) 1 - 



2 \4 
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Exercise 7.12 Show that the contribution of the 1/7(0102) graphs with k marked 
points on the first vertex is 

\ 

j (qo - o>i)~ (ao — ct2) ~ (a i — a2)~ (2ao — a\ — ai) ~ . 
Conclude that the total contribution to N3 from all ///(0102) graphs is 

1 s. 1 y O y 1 S 

— j(ao — (ao — a.2)~ (a\ — 02)" (2ao — a\ — ai)~ (3«o — «i — 2a2) . 

Exercise 7.13 Explain why there is no graph corresponding to 7/(002). Explain why 
there is no graph contribution corresponding to 7(21). 

Exercise 7.14 Compute the contributions to of several graphs from Figure 7.2. 
Using some mathematical software it would be possible to push this computation all the 
way and get the answer ./V3 = 12. 

8 Derivation of the Euler class formulas 

In this subsection we derive the general formulas that were used to compute the various 
Euler classes used in localization computations. We derive the formulas for CP 2 ; 
however careful inspection shows that these formulas are valid for CP m as well. 

8.1 The Euler class of moving infinitesimal automorphisms 

Before computing e(aut([S, p]) mov ) we need to understand the bundle of automorphisms. 
First consider the automorphisms of a genus zero irreducible component with no marked 
points mapped into CP 2 by the composition of a degree d map from CP 1 to CP 1 and 
an inclusion of CP 1 into CP 2 . This is exactly the situation described in the following 
example. We assume that this component is attached to the rest of the stable map by a 
node at [1:0]. 

Example 8.1 The T-action on 0ef(«) is a good example to study the induced /-action 
on a space in the deformation complex. Consider the /-fixed stable map given by 
u([xq : xi]) = [xq : xf : 0]. The action of / on this is just by pointwise multiplication 
of the coordinates. However, as we saw in Example 7.1 it is not immediately clear from 
this description why this map is fixed. We would like to include the reparametrization 
demonstrating that the class of this map is built into the definition of the action. The 
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reparametrization requires a dt\\ root of elements of T, but this can not be done in a 
consistent way. This may be corrected by considering the representation where T acts 
by multiplication of dth powers of the elements of T. The first Chern classes of the 
line bundles associated to the irreducible factors of this representation are d times the 
first Chern classes of the original representation. It follows that we can get the Chern 
classes of the line bundles in the original representation by dividing by d. The induced 
action on a 1 -parameter family of maps (that is, a deformation) is then given by 

(A • u t )([xo : xi]) := 

[A^°(A " 1 ^o,A7 1 jci) : Af^(A l x , A^xi) : \(uj(X^ l x , X~ l xi)]. 

Expressed in this way it is clear why this action fixes the map u . 

One can now determine the r -action on out([S, p\), and apply it to a specific bubble. 
Recall that the map out([S,p]) — ► Def(w) is given by ip t \— ► u o <p t . Assuming that the 
domain of a stable map has a bubble with a node at [1 : 0] and this bubble is mapped 
into CP 2 by a degree d map (as described in Exercise 8.3) any automorphism must 
restrict to this bubble to a map of the form tp([xo : x\\) = [axo + bx\ : x\\. In order for 
the map aut([E,/?]) — > Oef(w) to be equivariant with respect to the dth power action, 
we must have 

(A • ipt)([x :xi]):= [a t x + AoA^^jci : xi] . 

A nice way to compute the first Chern class of the corresponding subbundle is to use 
the Borel construction. Recall that one can combine a principal J -bundle with a T 
representation to get a vector bundle, as explained by the following exercise. 

Exercise 8.2 Let L„ 0i .. v1l be the line bundle associated to the representation A • z = 
AS ... X n k k z and show that 

ci (£>!(,,...,„! ) = «oCi(£i,o,-,o) H h«feCi(Lo,...,o,i) • 

We conclude that the first Chern class of the corresponding subbundle is (tt[i : o:0] — 
ot[0:\:G\) / d , ak is the Chern class of the bundle associated to the divisor <pk described in 
the computation of the equivariant cohomology of CP" . 

A nice way to keep track of extra factors like 1 jd involved in computing the Chern 
classes is to introduce the notion of a virtual representation. Virtual representations are 
just Q-linear combinations of ordinary representations. The virtual representation on 
the automorphisms arising from the usual J-action is defined to be \ times the fifth 
power representation. 
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We can split the automorphisms of a marked curve into automorphisms of each 
irreducible component of the marked curve and compute the Euler class of the bundle of 
infinitesimal automorphisms from the automorphisms of the components. In fact we can 
split each factor in (14) into a sum of line bundles. The irreducible representations of T 
are all one-dimensional. These representations induce line bundles over the components 
of the fixed point set of the moduli stack. The Euler class of a complex vector bundle is 
just the top Chern class of the bundle (see Bott and Tu [30]) and the Chern class of a 
sum is given by 



To get to e(aut([S,p]) mov ) consider other irreducible components of the fixed stable 
map labeled by graphs as described around Figure 7.1. Each irreducible component of 
the stable map contributes a summand to the bundle aut([S,/?]) mov and hence a factor 
to the Euler class. The factor of e(aut(['E,p]) mov ) corresponding to an edge in the graph 
representing a component of the fixed point set that has two nodes or a node and a 
marked point is trivial. The induced action on the factors of aut( [£,/?]) corresponding 
to any contracted component (that is, one for which the stable map is constant) is trivial. 
The above discussion implies that 



8.2 The r -action on the deformation complex 

Before addressing the deformations of the underlying curve, we explain the T-action on 
all of the terms in the deformation complex in greater detail. We can use the homological 
algebra introduced to derive the deformation-obstruction complex to give a uniform 
treatment of the induced actions on the terms in the complex. 

Given an action T x X —> X and a stable map fixed by this action, u : S — > X one can 
construct the <ith power action and the corresponding virtual J -actions on X and X 
making u equivariant as we did in Exercise 8.3. 

Exercise 8.3 Let u : CP 1 U[o : i]=[i:0] CP 1 -> CP 2 be given by 

u([xo '■ x{\, 1) = {x d Q : x d[ : 0] and u([xo : X\], 2) = [0 : x^ 2 : xf 2 ]; 

the original action on CP is given by A • [zo : Z\ : Zi\ = [AoZo : Aizi; A2Z2] ; then the 
map u is equivariant with respect to the action on CP 1 given by 

A • {[xq : xi], 1) = ([Aq 2 ^o : A^xi], 1) and A • ([x : xi],2) = ([XqXq : A 1 1 xi],2), 



c top (E © F) = c top (E)c top (F). 



e(aut([Z,p]r m )= J] 



val(v)=l 
n(y)=0 
«(v)=0 



d(e) 
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the d\di power of the original action on CP 2 



Recall that the push-forward of a sheaf A under a map / : X — > Y is given by 
f*AQJ) := A(f~ l (U)). Let La represent left multiplication by A. We will consider 
the push-forward by L\ of various sheaves. There are natural transformations given by 
pull-back on the various sheaves, 

L^ : ^A* &x 

L*\ : Oe — >L\*0-£ 



The map in the first line is just the pull-back of holomorphic 1 -forms on X and the rest 
are analogous. The push-forward just formalizes the fact that the pull-back of a form 
over U is a form over L^~ 1 (U). Any of these pull-backs will induce a group action on 
the space of sections over any invariant set. In particular, they induce actions on the 
space of global sections. Our convention will be to use left group actions everywhere. 
Recall that a right action may be turned into a left action by taking the inverse of the 
group element. These actions will induce actions on the ext-groups. Recall that 

aut([E,/?]) = Ex^ E (Q E (p), O s ) Hom OE (f} s (/>), O e ). 

Given X G Homc> E (J7s(p), Os)» the left action is given by 

(A • X){6) := LUX(L* X 6)). 

Working in the ([xo : 1], 1) -chart we obtain 

(A • ((dx + b)d x ))(dx ) = L* x *((ax + b)d x (L* x dx )) = dx Q + X^X^b. 

Note that A • ([xo : 1], 1) = ([Xo 2 X~[ d2 ], 1) This agrees with our earlier computation of 
the action. 

Exercise 8.4 Check that the action on oef([S,p]) derived via homological algebra 
agrees with the action described earlier. 



8.3 The Euler class of moving deformations of the curve 

Following Cox and Katz [43] , we can use local models to analyze the L-action on 
oef([£,p]) mov . Most marked points are on contracted components. The deformations 
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corresponding to moving these points are tangent to the fixed point set of the action. 
The one exception is when there is exactly one marked point labeling a vertex of valence 
one. This corresponds to a marked point on one of the branch points of a degree d cover 
of a standard line in CP 2 . The space of deformations corresponding to moving such a 
point is trivial because a genus zero curve with two marked points has no deformations. 
All of the normal deformations arise from resolutions of nodes. The local model here 
is £ = {(x, y) G C 2 |xy = 0}. Let = (xy) be the sheaf of algebraic functions on 
C 2 containing a factor of xy (this is called the ideal sheaf) and notice that we have the 
following exact sequence, 

T E /2|->nJ, 2 | E ->n! 3 ->o. 

The associated long exact sequence of ext-groups reads 

Ext<y^ 2 | E , e> E ) Ext^ s (j s /J 2 , e> s ) 

Ext^L C> s ) Ex4 B (^ 2 | s , Cs) • • • ■ 
Now, |s is a free Os -module, so Extjj (fi^ls, Cs) = 0. It follows that 
Ext^L C> s ) coker(Ext^ E (^ 2 | s , C s ) Extg, E (J ff /2|, S )) 

7b(C x 0) e r (0 x o . 

These local results can be combined to give cief([S,/?]) mov . We first introduce or recall 
some notation. 

Let F refer to a flag in the graph of the stable map (that is, a pair consisting of a vertex 
and incident edge). We will use the flag to denote the vertex or the point on S or the 
point on CP 2 corresponding to the vertex without comment. The normalization of the 
surface is denoted by £ and the node branches will be denoted by b\ and &2 - Recall 
that Cu denotes the line bundle over the moduli space whose fiber is the cotangent space 
of the corresponding curve. 

Combining these local results gives, 

dcm,p]) mm = ( (^s e r^E)) ( (7>£e£*)Y 

\al(v)=2 ' \nl(F)+n(F)+2g(v)>2 ' 

n(v)=0 
S(v)=0 

This formula translates directly into a formula for the Euler class, 

= ( n r%T' l) + ak %7 2) )) 



x val(v)=2 
n(v)=0 
«(v)=0 



( n R^-*)> 

V val(F)+n(F)+2g(v)>2 
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8.4 Euler class associated to the map 

We now turn to the computation of the Euler classes of oef(w) mov and ob(w) mov . The 
following exact sequence serves to define holomorphic functions on a nodal curve in 
terms of holomorphic functions on the normalization v. E — > S : 

o -»■ o s -► i/*0 £ -> © c e> c -> o. 

Here we use c to denote the nodes (crossings) of E . We have a related exact sequence 
for holomorphic sections of the pull-back bundle u*TCP 2 : 

o -> o E («*rcp 2 ) -» i/*Og(i/*«*rcp 2 ) -> © f r„ (c) cp 2 -» o. 

The associated cohomology long exact sequence reads, 

o -> a°(e, o s (w*rcp 2 )) -> o g (i/*M*rcp 2 )) 

© c h°(E, r u(f) cp 2 ) -> //^e, o s ( M *rcp 2 )) 

-» /Ae, Og(^* M *rcp 2 )) -» ©^(E, r H(c) cp 2 ) = o. 

Notice that the fibers of Def(w) and ob(a) over [u, E,p] are H°(E, O s (m*JCP 2 )) and 
H^SjOs^rCP 2 )) respectively. There are analogous sequences associated to any 
point [u', E',p'] in the moduli space and the spaces in these sequences glue together to 
define J-equivariant vector bundles over the moduli space. This is covered in more 
detail in Section 9 below. The sequence of linear spaces generalizes to a T-equivariant 
sequence of vector bundles. This implies that 

e(uef(M) mov ) _ ^//°(E,0g(i/*w*rCP 2 )) mov ) 
e(ob(u) mm ) e (//i(E, 0g(i/* M *rCP 2 )) mov ) U c e(T u{c) CV 2 ) ' 

Recall that we are computing equivariant Euler classes here. It turns out that the 
possibility of a nontrivial bundle with nontrivial action does not arise in our computations. 
Thus, there are two different cases that we need to analyze. The first case is when 
the bundle is possibly nontrivial and the action is trivial. In this case the equivariant 
Euler class is just the usual Euler class. The second case is when the bundle is trivial 
and the action is nontrivial. In this case, the action on a vector space induces a bundle 
over ET x T V and the equivariant Euler class is just the product of the Chern classes 
associated to the irreducible representations. Recall that ctk is the first Chern class of 
the bundle associated to the dual representation to the projection to the &th factor \i\ . 

The standard action of T on CP 2 may be expressed as A • {z\,Zi) = (AiAqZi, A2A0Z2) 
in the {z\ , Z2)-chart. It follows that 

r[i:0:0]CP 2 = fJ,() <g> O* © jU|) and dT[l : 0:0]CP 2 ) = (« ~ a\)(UQ - « 2 ), 
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so 

He(T U(c) cp 2 )) = n n (.<*m - «/)• 

c F j^k(F) 

Remark 8.5 The number of nodes at a vertex with n(v) = g(v) = is one less 
than the valence of this vertex, so when we take this product over all flags we should 
remove one flag from each such vertex. However, it is easier to include all flags 
here and include canceling terms in the expression for e(H°(E, 0(u*TCP 2 )) mm ) (see 
Remark 8.7). We abuse notation by not introducing new notation for these modifications 
to n c ^ M(c) CP 2 )) and e(H°(% 0(w*rCP 2 )) mov ). 

The space H°(E, e>gO*w*rCP 2 )) mov splits into a sum of terms corresponding to the 
components of S. Clearly the bundle u*TCP 2 is trivial over contracted components. 
Thus the contribution to the Euler class from the contracted components is 

n n (a ^) ~ a i ) - 

val(v)+n(v)>2 j^k(v) 

To compute the contribution to the Euler class of the non-contracted components we 
will use the Euler sequence described in the next exercise. 

Exercise 8.6 The subbundle of T(C 3 — {0}) generated by Zkdk\(zo, Z\ , z%) is invariant 
under the multiplicative action of C x . It therefore induces a bundle, say L, over the 
quotient CP 2 . Check that the following is an exact sequence of vector bundles (it is 
called the Euler sequence): 

-» L -> C 3 TCP 2 -> 0. 

The Euler sequence leads to the following sequence of sheaves over CP 2 : 

-» cp2 -» Oil) C 3 -> TCP 2 -> 

As a representative model of the non-contracted components, consider the map u : CP 1 — > 
CP given by u([xq : x{\) = [xq : x\ : 0]. Pulling back to CP 1 via u and taking 
cohomology gives 

-> H°(CP\ O cp i) -» H°(CP l , 0(d)) ® C 3 

-» H (CP\O(u*TCP 2 )) -> ^(CP 1 , C CP i) -»■ • • • . 

Recall that H (CP\O(d)) can be identified with the space of degree d homogeneous 
polynomials in xq and x\ . Each of the terms in the last sequence is J-equivariant. 
Recall that the J-action on C 3 is given by A • z = (AoZo, AiZi, A2Z2) and the maps 
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[zo '■ Zi] i— ► [Zq 1 ' : J-j] generate all degree d meromorphic functions on CP 1 . As a 
J-representation, we have 

h\cp 1 , 0{d)) ® c 3 = (0 ® M f /d ) ®( W e w eM 

Thus taking equivariant Euler classes gives 

2d. 

e(H (CP\O(u*TCP 2 ))) = II n(-T«o + =Ui - a k 



k=0j=Q 



n - ^0) n ( V a ° + J -/ ai ) n( V a ° + ^ - ai ) 

(-i)V) 2 , M^rfd-j , 7 



(ao-«i) lH k -^-«o + ^ai-a 2 J. 

7=0 

In general we have 

^ o (s,0( M *rcp 2 )r ov ) = n n ^ - 

V #*(v) 

111 d{e) We) ^ m -a k(vl) ) >[[[ —OW+— a Kvl) -aj) 1 . 

j/*(v),*(v') 

Remark 8.7 The first product in this expression should be over all v such that 
val(v) + n(v) + 2g(v) > 2 because these are the vertices with contracted components. 
However, taking the product over all vertices exactly cancels the extra terms introduced 
in n c e(T u{c) CP 2 )) in Remark 8.5. 

We have one remaining term to compute to finish our computation ( 10) of the equivariant 
Euler characteristic of the normal bundle to the fixed point set, namely 

e(// 1 (E,O(i/* M *rCP 2 )) m0V ). 

Notice that the Kodaira vanishing theorem implies that the cohomology corresponding 
to non-contracted components vanishes, giving 

H\E,0(i/*u*TCP 2 )) ^ ® v H l (K,0(v*u*TCP 2 )). 

Now 

H\%,, 0{v*u*TCP 2 )) * H\%,, Sv )) ® T U(V) CP 2 
and H\X V , C Sv )) * (fl°(E v , Si ,))) v = E v . 
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(Recall that E is the Hodge bundle.) Putting this together gives 
H\%, 0(u*u*TCP 2 )) 9* E v r H(v) cp 2 

^ e v ® (m ® n* Q e ^2 ® no) 

= E v <g> m <g> nl © E v ® ii 2 <8> Hi- 
Here we are assuming that u(v) = [1 : : 0] . 

Exercise 8.8 The splitting principle states that any formula for characteristic classes 
that is valid for sums of line bundles is valid for arbitrary bundles. Use the splitting 
principle to prove that 

r 

e(E (g> L) = ^aWdiLy-', 
i=0 

when E is a rank r vector bundle and L is a line bundle. 

It follows that 

e{H\% 0{u*u*TCP 2 )) mov ) = 

11 11 ^ Ci -(E v )(a, (v) -a^-''. 

val(v)+n(v)+2g(v)>2 j^k(v) i=0 

Remark 8.9 We derived the formulas for the factors of the Euler class of the normal 
bundle to the components of the fixed point set for CP 2 . However careful inspection 
shows that all of these formulas are valid for CP" without modification. 



9 The virtual fundamental class 

There is one new ingredient that arises in this computation. Up to now we have 
considered intersections in general position. It is still possible to do intersection theory 
when intersections are not generic. We now describe this non-generic intersection 
theory. 

Let 7T : E —> X be a vector bundle and let ao be the zero section. Let F be a subbundle 
of E and a: X —> F be a generic section of F. We may also consider a as a section of 
E, but it will not be transverse to the zero section. Let Z := cr _1 (°"o) and consider the 
following exact sequence of bundles, 

► TZ ► TX\ Z da > E\ z ► F ± \ z ► 
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Since £ is a vector bundle, we have a natural map E — > TE. We also have a map 
dao : TX — > 77?. One can check that r.E| CTo (x) = dao(TX) © £| CT0 (X). The map labeled 
by Jct in the above sequence is the projection of the push-forward to E\ ao (x)- The 
bundle F L \z is called the obstruction bundle. It is usually denoted by ob(Z). Notice 
that this agrees with our earlier description of the obstruction bundle. If the section a 
were generic then the fundamental class of Z would be the Euler class of E. We define 
the virtual fundamental class of Z (denoted by [Z] vir ) to be the the Poincare dual of the 
Euler class of E. We have 



[Z] vir = PD(e(£)) = PD(e(F) U e(F^)) 

= PD([Z] U e(F L )) = [Z] n e(F L ) 
We now consider an example of the virtual fundamental class. 



[Z] n«(ob(Z)). 



Example 9.1 Consider the self-intersection of a line in CP 2 . The usual way to 
compute this is to perturb one copy of the line and then take the intersection, but this 
is not necessary. Let L\ = CP 3 -{[0:0:0:1]} with projection L\ — > CP 2 be 
the Chern class 1 line bundle over CP 2 . Setting o\ : CP 2 — > Li to be the section 
o"i([x : y : z\) = [x : y : z '■ x], we see that a^ l (ao(CP 2 )) is just a standard line. The 
self-intersection of this line is just the zeros of the section a = a\ © o\ of L\ © L\ . 
In this case it is easy to see that a takes values in the diagonal L\ subbundle and is 
transverse to the zero section of this subbundle. This is exactly the situation described 
above, so we have 



^cp 1 ncp 1 ) = f l = f l 

=/ 

JCF 



^obCCP 1 )) 



n 



CP 1 



CP 1 



1. 



This same behavior happens in computations on many moduli spaces. Our main example 
is the large Af dual of the 3 -sphere. 

Definition 9.2 The local P l or small resolution of the conifold is denoted by X S 3 or 
O(-l) © O(-l). It is the total space of the O(-l) © O(-l) complex vector bundle 
over CP 1 . We have, 

X s3 := ((bo : zi],wo,w u w 2 ,w 3 ) G CP ! xC 4 det ( W ° Wl ) = det ( Wl W3 ) = o) 

[ \Z0 Zl J \ZQ Zl) J 



The restriction of the symplectic form on CP 1 



is a symplectic form on X S 3 . 



Let qi : X S 3 — > CP 1 and q2 : X S 3 — > C 4 be the projection maps. Clearly these maps 
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are holomorphic. Given a holomorphic map u : E — ► X S 3 we see that q2 o u is 
holomorphic, therefore constant (see Exercise 9.3.) Let (w\ , vt>2, W3, W4) be this constant 
value. For positive degree d the map q\o u must be surjective. Taking xq 6 E with 
gi o m(jto) = [0:1] we see that wo = W2 = 0. Taking x^ G E with q\ o w(xoo) = [1:0] 
we see that w\ = W3 = 0. 

Exercise 9.3 Recall the definition of the d operator from Section 3. Show that du and 
du are perpendicular, and that 

(|<9w| 2 — |<9w| 2 )<ivoi£ = —g(du,Joduoj))dvolj; = 2u*u>x- 

Conclude that 

- / \du\ 2 Jvols = / |5w| 2 Jvols+ / u*u>x- 

2 JT, iS JT, 

When u is holomorphic, E is closed and u*lux is exact this implies that u is constant. 

Let (Jo : CP 1 — > X 5 3 be the zero section. Our previous discussion implies that the 
induced map 

cto: M g ,o(CP\ d[CP 1 ]) ^M g:0 (X S 3,d[CP 1 ]) 
is an isomorphism of stacks. According to Exercise 5.25, 

virdimCM^o^, ^[CP 1 ])) = 0, 

and 

virdimCM^oCCP 1 , ^[CP 1 ])) = 2g - 2 + 2d . 

We conclude that if the moduli space AigfiiCP 1 ,d[CP 1 ]) is unobstructed, then we 
are in exactly the situation described by the excess intersection formula. In Section 5, 
we claimed that homological algebra would allow one to extend the spaces from the 
deformation-obstruction complex to bundles over the moduli space. This is similar to 
the situation encountered earlier of glueing the spaces T^E into the bundle Ck over the 
moduli space. We use the universal curve over the moduli space here as we did before. 
This time we need a construction from homological algebra called higher direct image 
functors. 

Given a left exact functor F : C V and an injective resolution A — > 1° — > I 1 — > • • • 
one defines the right derived functors of F applied to A (denoted R*(F)(A)) to be the 
cohomology of the complex F(I°) — > F(I l ) Given a map / : X — > Y and a sheaf 

A over X one defines the direct image sheaf f*A over Y by f*A(V) := A(f~ l (V)). 
The higher direct image functors are just the right derived functors of the direct image 
functor. Hyper-derived functors are a generalization of derived functors applicable to 
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complexes of sheaves. One takes injective resolutions of each sheaf in the complex and 
applies the functor to obtain a double complex. The hyper-derived functor is just the 
cohomology of the corresponding total complex. Hyper-higher direct image functors 
R*(/*) of complexes may be defined via the total space of the resulting complexes in 
the same way one defines the hyper-ext functors. 

We will apply the hyper-higher direct image functor to sheaves arising from vector 
bundles. To any vector bundle we can associate the sheaf of sections. The sheaf of 
sections of a holomorphic vector bundle over X is a locally free, finite rank sheaf of 
Ox-modules. Conversely, given a locally free, finite rank sheaf £ of Ox-modules one 
defines an associated holomorphic vector bundle. Namely, there is an open cover {U a } 
of X and isomorphisms ip a : Ox(U a ) n — > S(U a ). Define ib a p := ip~ l o ipp on the 
overlaps and E :=_LL U a x C"/ ~, where (a,x,z) ~ ((3,x, tp/3 a (x)z). 

Exercise 9.4 Show that T(E) is naturally isomorphic to £ . Show that 

We conclude from this exercise that the fiber of E over a point xo may be identified 
with £ ® 0x (Cxo/m^). 

Now consider the universal curve over the moduli space. We have the vertical bundle 

V ► U x M g , n (X, f3) 

with sections pt : M. g>n (X, (3) — > Ux and the diagram 

ev* x TX > TX 

v 



7TX 

V 

M g , n (X,P) 

Let C-p be the line bundle associated to the divisor 

- Pl (M g ,„(X, f3)) Pn(M g ,„(X, (3)) . 

Let s be a section of this bundle with simple zeros along the divisor. Define V- p to be 
the bundle V <8> and a bundle map ev*(— (g) 5) : V- p — > ev x 7X. 
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Definition 9.5 The bundles in the deformation-obstruction sequence of a stable map 
are naturally associated to the sheaves given by: 

BefOO := R°(vr*)(ev*7X) 
ob(u) := RV*)(ev*ZX) 
out([E,p]) := R (7T*)(V 
Dcf([S )jP ]) := RV.XV 
autfl>, £,/>]) :=R°(7r*)(V 
i>ef([a,E,/>]):=R 1 (7r.)(V 
ob([ M , £,/?]) :=R 2 «)(V 

Before explaining why these bundles over the moduli stack have the required fibers we 
should explain exactly what the various maps actually are in this setting. In the above 
description we just treated stacks as spaces. The easiest way to understand a stack 
at this point is to use the definition that the moduli stack is the contravariant functor 
from the category of schemes to sets that associates the set of all equivalence classes of 
families of stable maps over a scheme to a scheme. 

In short, a stack is just the collection of all families of stable maps over a 
scheme. 

Several observations will clarify a correct way to think about these constructions. The 
first observation is that the coarse moduli space is the set that the moduli stack associates 
to a point, that is, A4 g> „(X, /3)(pt) = M gjn (X, (5). The second observation is that the 
universal curve over M. g , n (X,(3) is just Ux = M gt n+i(X, 0) even when the stable 
maps have nontrivial automorphisms. The map 

tt x : U x ^M g , n (X,P) 

is just the natural transformation of functors that takes a family of (n+1) -pointed 
curves to the family of ^-pointed curves obtained by ignoring the last point (section) 
and stabilizing the fibers. The third observation is that the stack associated to a space 
just consists of all maps from schemes into the space, and the map evx : Ux —> X is 
just the natural transformation that takes a family [w : W — ► X, W — > 5, p] to the 
map w o p n+ i — > X. The example in Section 10 will continue with the idea that the 
moduli stack is the collection of families of stable maps. The following exercise is good 
practice translating constructions into families. 

Exercise 9.6 Give a definition of the vertical bundle 

V ► U x M g , n (X, (3) 
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by making V a stack that associates appropriate families of vector bundles over families 
of stable maps. 

We now describe why the fibers of the bundles in the deformation-obstruction sequence 
are the expected spaces. To apply Exercise 9.4 to the sheaves of the deformation- 
obstruction complex, we need a theorem of Grauert (see Hartshorne [75, page 33]). 
Recall the definition of flat morphism from Section 5.2. 

Theorem 9.7 If f: X — > Y is a flat morphism, T is a coherent sheaf over Y and 
dim 0v / mv H k (Xy, !F y ) is constant, then ^(/^(J 7 ) is locally free of finite rank and 

R k (f*)(F) ®o Y {Oy/m y ) H k (X y ,F y ) , 
where X y is the fiber over y and T y = !F\x y ■ 

This theorem gives us a good way to think about higher direct image 
functors - under nice conditions the higher direct image functors associate 
to a family of spaces over 5 a vector bundle over S with fiber equal to the 
cohomology of the fiber in the original family. 

Notice that the condition that H k (X y , T y ) have constant dimension is not satisfied in 
cases of interest to us. For example, a degree three map from a surface of genus 
three to Xp has no nontrivial infinitesimal automorphisms of the underlying surface. 
However there is a nodal surface in the same moduli space consisting of the one point 
union of a surface of genus two and a surface of genus zero mapped by degree two 
on the genus two part and degree one on the genus zero part. This nodal surface has 
a two-complex-dimensional space of infinitesimal automorphisms. The next exercise 
compares the virtual fibers of the bundles from the deformation-obstruction bundle 
complex with the spaces from the deformation-obstruction complex. 

Exercise 9.8 Given an injective resolution Oy, — ► 1° — » I 1 — ► • • • , show that 
u*TX —> Hom0 E (w*$7x) /°) — ► • • ■ is an injective resolution of u*TX. Conclude that 

« 1 (7r*)(ev*ZX) 8>o/m (C/m) ^ H\Z,u*TX) = Ext\u*Q x , O e ) . 
Repeat this computation with the other bundles. 

Assume for the moment that the fiber 

Ext 2 (u*n x , 0s) = H 2 (S, TS -» u*TX) 

has constant dimension as [u, S] varies in MgfliXp, d[CP 1 ]). (Here M k represents 
hypercohomology.) Then ob([u, £]) = R 2 (7r*)(V — > ev*TX) satisfies the assump- 
tions of Grauert's theorem and therefore satisfies the assumptions required for our 
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definition of the virtual fundamental class. Using the map gq : 
M gfi (X S 3,d[CP 1 ]), we write 



1 



{MgAXa ^[CP 1 ])]* 



[M s ,o(X s 3,d[CP [ ])] 



e(R 2 (7r*)(V -> ev*TX)) 
cj*e(R 2 (7r,)(V -► ev*TX)) 



[M g , (CP l ,d[CP 1 ])] 

e(o-* R 2 (ir*)(V ->■ ev*7X)) . 

We continue with the change of base theorem for higher direct image functors. See 
Hartshorne [75, page 255]. 

Theorem 9.9 Given a commutative diagram, 

W 




Z > Y 

u 

with u flat and a complex of coherent sheaves A one has 

u*R*(f*)(A) = R*(g*)(v*A) . 

We conclude that 

a *R 2 (7rf)(V -> ev*TX) = R 2 (^ Cpl )(«r*V - ^evjTX) . 

Here do : W CP i — > Ux is the natural map. Since <7o o ev CP i = ev^ o do, we know 

R 2 (ir? l )(r Q V -> ^evjTX) = RVP'X^V - ev* pl ^7X). 

To go further, consider the exact sequence of bundles, 

ev^.rCP 1 -> ev* pl a57X ev* pl (0(-l) 0(-l)) 0. 

By the horseshoe lemma from homological algebra there are injective resolutions 
ev^rCP 1 -» /*, ev^^TX -> 7* and ev* pI (C(-l) © O(-l)) that form a short 
exact sequence of complexes 







J* 



K* 



0. 
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Exercise 9.10 Let 0— >A^B^C^0bea short exact sequence of modules, 
and let A — > /* and C — > K* be injective resolutions. Show that there are maps 
e: B^I°®K° andd k : I k ® K k ^ I k+l ® K k+l such that B -» /*© is an injective 
resolution that fits into a short exact sequence of complexes (see Weibel [156]). 



Let V — > L* be an injective resolution and form the following short exact sequence of 
complexes, 




Applying the direct image functor 7r!p p ' to each term and writing out the associated 
long exact sequence of cohomology groups produces a long exact sequence containing 



-> RV*)(ev* pl (0(-l) © 0(-l))) -> R 2 (vr*)(ev* pl V -> ev^TCP 1 ) 
-> R 2 (7r*)(ev* pl V ev* pl ^7X) -> R 1 (^)(ev^. pl (0(-1) © 0(-l))) 

Now look at the deformation-obstruction sequence for MgfldCP^dlCP 1 ]), 



0. 



— > auttfv, £]) — > aut([S]) — ► Def(v) — > Def([v, £]) — ► 0ef([E]) 

— ► ob(v) — ► ob([v, £]) — ► 0. 

The fiber of ob(v) is H l (Y,, v*TCP l ). For every stable map in the genus zero case 
this cohomology group is zero (this property is called convexity). For CP 1 convexity 
follows from the Kodaira vanishing theorem (see Griffiths and Harris [68]). This implies 
that 

= ob([v,S]) = R 2 (7r*)(ev^ pl V -» ev^TCP 1 ), 

so that 

R 2 (7T*)(ev^. pl V -» ev* £pl a* TX) * R 1 (vr,)(ev^ pl (0(-1) © O(-l))), 
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by the previous exact sequence. Now, 

dimctf '(E, v*(0(-l) © O(-l))) = 2d + 2g - 2 

by the Riemann-Roch theorem together with the Kodaira vanishing theorem. Since this 
is constant, we conclude that the excess intersection formula holds and 

N g , d = [ c 2d+ 2,-2(R 1 (vr*)(ev^ pl (0(-l) © O(-l)))). 

./[.M^oCCpVpCP 1 ])] 

The argument we used to get here was valid for the g = case. In fact, one can 
show that this formula is valid for all genera. When the dimension of the virtual 
fibers of ob([u, £,/?]) is not constant, one can still define a virtual fundamental class. 
This was done independently and almost simultaneously in the spring of 1996 by 
Fukaya-Ono, Hofer-Salamon, Li-Tian, Ruan and Siebert, see Salamon's lectures 
[134], 5 and references therein. All of the authors originally worked in a symplectic 
setting. Our description is closest to Li-Tian [98] who later extended their results to an 
algebraic setting [97]. See also Behrend [29], Cox and Katz [43] and Liu [99]. 

Virtual fundamental classes are important in the computation of degree zero Gromov- 
Witten invariants as well. The next exercises address this situation. 

Exercise 9.11 Show that the obvious map gq\ Mg, n x X —> ■M. gl n(X,0) is an 
isomorphism of stacks. 

Exercise 9.12 Let ttm '■ M.g,0 xX^ M. g) n and ttx : M. g ,o x X —* X be the projection 
maps and show that 

/ 7= I c top (w* M E v (8)7r|7X)U(7S7- 

i[M s ,„(X,0)]™ J[Mg,„] m 'xX 

Exercise 9.13 Use the splitting principle to show that dimc^ = 3 and c\(TX) = 
imply 

N gfi (X):= [_ 1 =(-l)* x (X) /_ c g _!(E) 3 /2. 

J[M gfi (X,0)] v " : JlM g , ]™ 

10 The multiple cover formula in degree two 

In this subsection we describe how to compute the Gromov-Witten invariants of the 
manifold X S 3 . We begin with a direct computation of 

N , 2 (X) := (l>o )2[C pi r 
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A localization computation of this same number is presented in Cox and Katz [43]. We 
begin by analyzing the corresponding coarse moduli space. 

Recall from the previous subsection that any stable map into X s i factors through 
CP 1 , so the moduli stack of stable maps to X S 3 is isomorphic to the moduli stack 
of stable maps to CP 1 . Given a stable map, one can construct a graph with vertices 
corresponding to the maximal contracted components, edges corresponding to the 
non-contracted components, and labels corresponding to the marked points, images 
of the contracted components, genera of the contracted components, and degrees of 
the non-contracted components. Since the curves in our case have genus zero, the 
corresponding graphs must be trees. Computing the Euler characteristic of the resulting 
graph gives 1 = 5^ v (l — ^valence(v)). It follows that we must either have a vertex 
of valence zero or two vertices of valence one. In order to be stable the map must 
have positive degree on each vertex with valence less than three. It follows that the 
only stable curves in the genus zero degree two moduli space have domain CP 1 or two 
copies of CP 1 joined by a single node. 

Now consider a degree two holomorphic map u : CP 1 — ► CP 1 . Locally such a map has 
a power series representation, so it must be a branched cover. This implies that 

2 = xCCP 1 ) = 2XCCP 1 ) - J2 ( 2 - K'OOl) = 4 - \S(u)\ . 

yeS(u) 

It follows that such a map must have exactly two critical points and two critical values. 
We will use the critical values to parametrize these maps. Notice that pre-composition 
with a linear fractional transformation cannot change the locations of the critical values, 
so two maps with different critical values are different. Given any two distinct points p , 
q in CP 1 we can take a linear fractional transformation taking [0 : 1] to p and [1 : 0] 
to q. Composing the map [z : w] i— > [z 2 : w 2 ] with this linear fractional transformation 
gives a degree two map with the desired critical values. Notice that this is not an 
equivalence of stable maps because such equivalences must be by pre-composition. We 
will map a nodal curve to the image of the node. 

Exercise 10.1 Show that two stable maps with the same critical values are equivalent, 
and show that the automorphism group of any stable curve in this space is Z2 . Conclude 
that the coarse moduli space is isomorphic to the symmetric product of two copies of 
CP 1 . (This is a reasonable example to use to understand the Gromov topology, or the 
algebraic structure of the coarse moduli space.) 

The second symmetric power of CP 1 may be identified with the space of degree 
two polynomials up to scale. One associates the roots of the polynomial to the 
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polynomial. This gives the isomorphism Sym 2 CP 1 — ► CP 2 taking ([zo : zi], [wo : wi\) 
to [ziw\ : — zo^i — z\wo : zo^o]. The same map is an explicit isomorphism 

$: MofidCP 1 ^[CP 1 ]) = CP 2 , 

when the singular set of the map is ([zo ■ Zi], [wo : wi]). The boundary divisor is just 
D = {[a : b : c] £ CP 2 \b 2 - 4ac = 0}. 

At the level of stacks the universal curve is just the moduli space Mo^iCP 1 , 2[CP 1 ]) 
with evaluation as the map to CP 1 and projection as the map to M.o,o(CP l , 2[CP X ]). 
It is interesting to compute the corresponding coarse moduli space. We claim that 
Mo.iCCP^tCP 1 ]) is isomorphic to CP 2 x CP 1 with isomorphism taking [u,T,,p] to 
(3>(m), u(p)). Given a point in CP 2 x CP 1 one can take a stable map corresponding to 
the CP 2 -component and then pick a point in the inverse image of the CP 1 -component 
as the marked point. Such a point exists because the map has degree two. (If the 
inverse image is a node, we add a ghost bubble containing the marked point at the 
node.) If the CP 1 -component is one of the critical values there is a unique choice for 
the marked point. Otherwise there are two possibilities. However one finds that the 
resulting marked stable curves are equivalent. For example, the point [0 : 1 : 0], [1 : 1] 
gives a stable map with critical points [0:1] and [1:0] (the roots of bzoZi =0.) 
This map is just [z : w] i— > [z 2 : w 2 ]. The inverse image of [1 : 1] is just [±1 : 1]. 
The reparametrization [z : w] i— > [—z '■ w] takes one to the other. Most points in 
A/^iCCP , 2[CP ]) have trivial automorphism group. This is because the underlying 
stable map has automorphism group Z2 and only the trivial automorphism will fix the 
marked point unless the marked point is at a critical value. It follows that the points 
along the divisor 

Di = {[a : b : c], [z : w]\az 2 + bzw + cw 2 = 0} 
have automorphism group Z2 . 

For the universal curve U — > M one should have that the inverse image of a point 
so e M. is isomorphic to sq ■ This fails with the coarse moduli spaces. The inverse 
image of a point in CP 2 is a copy of CP 1 and the restriction of the evaluation map to 
this copy is just a degree one map. This should be a degree two map. The reason for this 
failure is that each map in this moduli space has automorphism group Z2 . It appears 
that one should take a double cover of CP 2 x CP 1 branched along the divisor Di . The 
fiber of such a cover over a point in CP 2 would be a two-fold branched cover of CP 1 
branched over two points. This is also a copy of CP 1 , but the induced evaluation map 
would have degree two as it should. The only problem with this is that no such cover 
exists in the category of schemes - such a map would restrict to a two-fold connected 
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cover over a simply-connected space. The map 

M Q ,i(CP\2[CP 1 ]) Mj.oCCP'^tCP 1 ]) 
has exactly this structure in the category of stacks. 

To go further we have to use the power of stacks. Recall that one description of a 
Deligne-Mumford stack is a contravariant functor from SCHEME to SET. Any scheme 
produces such a functor. For CP 2 one gets the functor CP 2 taking a scheme S to 
Mor(5, CP 2 ) . To analyze the local symmetry groups we need to use the fibered category 
structure. The objects in the associated fibered category are ordered pairs consisting of 
an element of the set associated to a scheme and the scheme. The points in the stack are 
just those objects corresponding to the one point scheme. For example, 

[0 : 1 : 0] := ([0 : 1 : 0] : pt CP 2 , pt) G Ob(D^ 2 ) , 

has Hom([[0 : 1 : 0], [0 : 1 : 0]) = {id}, so the local automorphism group of a point in 
the stack associated to CP 2 is trivial as expected. 

Let M. := A^o,o(CP , 2[CP ]). The argument showing that the corresponding coarse 
moduli space is isomorphic to CP shows that the points of this stack correspond to the 
points of CP 2 . Consider the local automorphisms of a point of this stack. Let 

z 2 := (u([z : w]) = [z 2 : w 2 ] : CP 1 -> CP 1 , tt : CP 1 -► pt, pt) G Ob(D^) . 

We have Hom(z 2 ,z 2 ) = {id,rc}, where n([z : w]) = [— z : w]. A similar thing is true 
for every point in M , so every point in this stack has automorphism group Z 2 . 

Exercise 10.2 Do a similar analysis in Mq^CP 1 , 2[CP 1 ]) to show that the points in 
Di have automorphism group Z 2 and all others have trivial automorphism group. 

We can interpret the fundamental cycle of the stack M to be ^[AfyoCCP 1 , 2[CP 1 ])] 
or just \ [CP 2 ] G // 4 (CP 2 ; Q). The factor of 1 /2 here is due to the Z 2 automorphism 
group. To compute the virtual fundamental class, we need to compute 

c 2d+2g _ 2 (R 1 (7r*)(ev* pl (C(-l) © O(-l)))). 

In this case 2d + 2g — 2 = 2 , so the Whitney sum formula gives 

c 2 (RV*)(ev^ pl (0(-l) © O(-l)))) = Cl (R\7r*)( eV * Cpl (0(-\)))) 2 . 

Attempting to construct a two-fold cover of CP 2 x CP 1 branched along D\ provides 
good motivation for the computation of the above first Chern class. In general to 
construct a p -fold cyclic cover branched along a divisor D, one constructs the line 
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bundle associated to D, say Lp, and takes a section vanishing along D, say o~d- If L l / p 
is a pth root of this bundle in the sense that (L l /Pf>P = Lj) , the desired cover will be 

{ZeL 1 /p\e = *DWQY'}. 

Applying this idea to the divisor D\ in CP 2 x CP 1 , we cover CP 2 x CP 1 by charts 
V a , Vb, and V c corresponding to a = 1, b = 1 and c = 1. Define a bundle 
L l J 2 = C 3 x (C 2 — {0})/ ~ where (a,b,^y,zo,Zi) ~ (a, b, A7, Xzo, Xzi), and similar 
bundles over V a and . We have a section of the tensor square of this bundle taking 
(a,b, [zq : ZiJ) to [a,b,az^ + bzozi + z\-,Zq-,Z\\- The problem is that there is no 
reasonable way to glue these pieces into a global bundle. Continue anyway and define 

Q c := {[a,Z?,7,z ,Zi] G ^c /2 |7 2 = azl + bzoZi + zj} 

with natural projection map it : <2 C — > C 2 and v : Q c —> CP 1 given by 

v([a,b,j,zo,Zi]) = [zo : zi]- 

Exercise 10.3 Show that 7r is a flat morphism. 

We can see that Q c is a flat family of stable, genus zero, degree two maps to CP 1 . We 
just need to check that the restriction of v to the inverse image of any point in C 2 is 
such a stable map. For example, we have an isomorphism CP 1 — > vr _1 (l, 0) given by 
[s ! t\ 1 — ► [1,0, s 2 — t 2 ,2st, s 2 + f 2 ] . The composition of this map with the restriction of 
v is a degree two map with critical values at [±i : 1] as expected. 

Exercise 10.4 Identify the restriction of v to 7r _1 (0, 0). 

Stacks should be considered as generalizations of schemes that include orbifold 
information. Just as a manifold is defined via a maximal atlas while specific manifolds 
are usually described by a finite atlas, a specific stack can be described by a finite 
cover while the general definition adds a condition analogous to maximality. One can 
cover the stack MopiCP 1 , 2[CP : ]) by the family Q c together with two other analogous 
families denoted by Q a and Qt . 

Exercise 10.5 Construct analogous families Q a and Qb and use these three families 
to conclude that A^o,o(CP , 2[CP ]) is locally representable. 

We have been treating stacks as contravariant functors from the category of schemes 
to sets, so for example the stack associated to a scheme T is the functor that takes a 
scheme S to the set of morphisms from S to T. As explained in Appendix A, stacks can 
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also be viewed as fibered categories. The objects of the fibered category D- are just 
morphisms u : R — > T, similarly the objects of the fibered category version D M of the 
stack MofiiCP 1 , 2[CP 1 ]) are just families of stable maps [v : V — > CP 1 , 7r : V->R]. 

Notice that any flat family Q over a scheme T (really, [q : Q — > CP 1 , it : Q — > T] ) 
living in M^CP 1 , 2[CP 1 ])(r) defines a map of stacks (covariant functor) taking D- 
to D M . This map takes a morphism u : 7? — ► J in D- to the fiber product family 
[qopr v Q x T R^R] where Q x T R:= {(x,y) £Qx R\ir(x) = u(y)} inD M . 

Exercise 10.6 Define the action of the functor on a morphism from u : R — > T to 
v : 5 — > T, that is, w: /? — ► 5 that satisfies m = v o w. 

An alternate way to think of stacks is as a categorical construction of 
terminal objects. The moduli space should be a terminal object in the 
category of families of stable maps. The problem is that generally no 
such terminal object exists. When we stackify we replace a scheme by 
a contravariant functor and a morphism by a natural transformation and 
arrive at a category in which the collection of all families itself turns into a 
terminal object. 

We now turn to the computation of the Chern classes in the formula for the Gromov- 
Witten invariants from the last subsection. Recall that Chern classes satisfy c(f*E) = 
f*c(E), so we could compute a Chern class of a bundle by computing the Chern class 
of the pull-back under a finite branched cover and dividing by the degree of the cover. 
Since H 2 (CP 2 ;Z) ^ Z, to compute the first Chern class of any bundle over CP 2 it 
suffices to compute the first Chern class of the restriction of the bundle to CP 1 . Even 
though there is no bundle corresponding to the square root of the line bundle associated 
to D\ , there is an object corresponding to the result of taking the pull-back of the 
restriction to CP 1 of such a bundle under a two-fold cover. We apply this to the CP 1 at 
a = 0. 

Define 

Q := {[a,b,c,zo,Zi] G C x (C 2 - {0}) 2 | a 2 = b 2 z Zi + c 2 zoZi} / ~, 

with [a, b, c, zo, Zi] ~ [A/xcn, Xb, Ac, /j,zo, l^Zi] ■ This is the total space of a family with 
projection ir : Q — > CP 1 taking [a, b, c, zo, Zi] to [b : c] and evaluation v : Q — > CP 1 
given by v([a, b, c, zo, Zi]) ■= [zo : Zi] - Notice that the variables b and c here do not 
correspond to the same variables as used earlier. It might have been clearer to use b\ , 
c\ here with the relations b = b\ and c = c\ defining the two-fold branched cover 
of the CP 1 cycle. This would just complicate the notation a bit further. The space 
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<2 is also a flat family of genus zero, degree two stable maps. We wish to compute 
ci(RH7r*)(v*(e>(-l)))). Recall that R\tt*) is just a vector bundle over CP 1 with fibers 
isomorphic to H l (ir~ l (—), v\*(0(— 1)))). It helps to remember how to compute sheaf 
cohomology at this point. 

Exercise 10.7 Define L n := C 2 x (C - {0})/ ~, with (z ,Zi, ~ (Azo, Azi, A"0- 
Taken with the natural projection to CP 1 , this is a line bundle. Let 0(n) be the 
associated sheaf of sections. Using the standard zi / 0, zo / cover of CP 1 , compute 
the Cech cohomology groups H*{€.V l \0(n)) for various positive and negative values 
of n. 

To compute R 1 we use the following description of it from Hartshorne's book [75]. 

Theorem 10.8 If A is a sheaf over X andf: X -> Y, then R k (f*)(A) is the sheaf 
associated to the presheaf taking V to H k (f~ l (V); A\f-i( V) ). 

One can check that the v pull-back of the L_i bundle over CP 1 to Q is the bundle 
defined by 

v*L_! := {[(,a,b,c,zo,zi] G C 2 x (<C 2 - {0}) 2 | a 2 = b 2 Zo zi + c 2 z zi}/ ~, 

with [£,a,b,c,ZQ,zi] ~ C> Xb, Ac, //zo, fJ-Zi]- Now work over the c / chart 
U c of CP 1 . We can take an open cover of the ir -inverse image of this chart consisting of 
Zi / and zo ^ 0. Computing R 1 {-k*){0{v*L-\)){U c ) amounts to computing the Cech 
cohomology of the inverse image of U c . A 0-Cech cochain consists of an algebraic 
section of v*L_i over the zi 7^ chart and a section over the zo 7^ chart. A section 
over the first takes the form \f(a, b,zo), a, b, l,zo, 1] where f(a, b, zo) is polynomial 
and we are using the obvious coordinates obtained by setting c and z\ to one. A 
algebraic section over the zo / chart takes the form [g(a, b, zo),a,b,l,l, z\\ with g 
polynomial. Using the relation from the definition of Q we can eliminate all second 
and higher powers of a and write f(a, b, zo) = fo(b, Zo) + ce/i (b, zo) ■ The polynomial 
g(a, b,zi) can be expressed similarly. A Cech 1 -cochain is just a algebraic section 
on the overlap. The Cech coboundary is just the difference of the restrictions of the 
sections from the two large charts. In order to compute this difference we must write 
each section in the same coordinates. Using the zi / chart we can write the Cech 
coboundary as 

$(f,g) ■= [(fo(b,z,a) - Zq ^0(^,20 ! )) + a(fi(b,zo) - z 2 gi(b,z l )),a,b,l,Zo, l] • 

Notice that on the overlap zo 7^ and a 2 = bzo + 1 , so any algebraic function on 
this overlap may be written in the form ZQ N (Fo(b, zo) + F\(b, zo)a) , with polynomial 
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Fq and F\. Combining this with the expression for the coboundary implies that 
the cokernel of 5 is the C[b] -module generated by Zq l a and this is the definition 
of R l (ir*)(v*((D(-l)))({c / 0}). This confirms our theoretical arguments from the 
previous subsection that R 1 (7r*)(ev*C>(— 1)) is a locally free, finite-rank sheaf of Os- 
modules of the correct dimension. On the b / chart we will use c, zo (z.i ) and a 
with b = 1 as our coordinates. 

Exercise 10.9 Show that R 1 (vr*)(v*(C(-l)))({Z7 / 0}) is the C[c]-module generated 
by Zq l a in the given coordinates. 

We must be careful when making the identifications between the b / chart and 
the c / chart. It might help to use a different variable, say (3 in place of a when 
describing the b / chart. The correct way is to scale the b = 1 answer to a 
c = 1 answer using the equivalence from the definition with A = c~ 1 . It follows 
that the section of R 1 (ir*)(v* (0(— 1))) over the c / chart given by Zq l a extends to 
a meromorphic section over all of CP 1 given by c~ 1 z l a in the b ^ chart. This 
meromorphic section has exactly one simple pole, and no other poles or zeros. It follows 
that cKR^^Xv^OC-l^XtCP 1 ]) = -1. We conclude that 

cKRHvr.Xev^OC-DMtCP 1 ]) = -\. 

Putting everything together gives 

M),2= / cKRWev* 0(-l))) 2 

JiMoMCP'aicp 1 ])] 

= ci (R 1 (tt*) (ev^ pl 0{- 1 )) ) 2 ( \ [CP 2 ]) 

= ^^^^^^(ev^OC-l^aCP 1 ])) 2 = \. 

While this direct computation clarifies all of the ingredients in the definition of the 
Gromov-Witten invariants, it is not very practical for computing the general case. To 
compute the answer in the general case, we return to localization. 

11 The full multiple cover formula via localization 

In this section we apply localization to compute the Gromov-Witten invariants of the 
resolved conifold . We should really say that we are using virtual localization. This 
is a generalization of the localization formula that we have already explained in two 
different directions. First one must apply localization in the stack setting, and second 
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one must apply it with virtual fundamental classes. The correct generalization is given 
by Graber and Pandharipande [66]. 

The standard torus action on CP 1 extends to a torus action on all of the spaces involved 
in the multiple cover formula for the Gromov-Witten invariants of 0(— 1) © 0(— 1). 
In fact this action extends to the bundle 0{— 1) in several ways. Just as line bundles 
are classified by the first Chern class, equivariant line bundles are classified by the 
equivariant first Chern class. An action on a vector bundle compatible with an action on 
the base is called a linearization. Using the classification of equivariant bundles it is 
standard to label linearizations by the associated equivariant first Chern class. Thus we 
label the linearizations of 0{— 1) by nao + ma\ — h. 

Exercise 11.1 Construct group actions on 0(— 1) corresponding to the equivariant 
class nao + ma\ — h. 

The numbers that we need to compute are 



Based on the computation of N3 via localization in Section 7.3 one might worry that 
this localization computation is going to be very complicated. In fact, the computation 
is fairly straightforward provided that one chooses the proper linearizations. Faber and 
Pandharipande discovered that if one chooses ao — h as the linearization on the first 
factor of 0(— 1) and a\ — h on the second factor then only components of the fixed 
point set corresponding to one graph contribute to N g d [55]. 

To see why only one graph can contribute let [u,H,p] be a stable map fixed by the 
group action. We start with the normalization sequence 



We already used this sequence in computing the Euler class of the normal bundle 
associated to deformations of the map in Section 8.4. As before we use c to denote the 
nodes of £ . We have a related exact sequence for holomorphic sections of the pull-back 
bundle u*0{-\): 




-»■ O s -» -> ® c o c -> 0. 



-» u*0{-\) 



v*v*u*0{-\) -> ffi c L_i| H(c) -> 0. 



The associated long exact sequence on cohomology reads, 



H°(H, v*u*0(—Y)) -» ® n L-i|„(„) -+H l &,u*0(-1)) 
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There are bundles over the moduli space with fibers isomorphic to the spaces in this 
exact sequence. We will use the fibers as names for these bundles so for example, we 
will denote the bundle RV*)(ev* pl (C>(-l))) by H\T,,u*0(-\)). 

The normalization is a union of smooth components £ = \J an d the cohomol- 
ogy H°(%u*u*0(-l)) is isomorphic to ®H (E k ,u*u*O(-l)). The contribution 
u*u*0{— 1)) is trivial unless the component is contracted to a point under 
the stable map in which case it is isomorphic to C . Since the original prestable curve is 
connected we see that there is at least one node attached to each contracted component 
and there are extra nodes if there is any vertex in the graph associated to the stable curve 
with valence grater than one. 

It follows that the cokernel of the map 

H\t,v*u*0{-\)) ^ @cL-i\u ( c) 
is only trivial if there are no vertices of valence greater than one. 

In the case the cokernel is nontrivial the long exact sequence of bundles implies that 
e(R 1 (7r*)(ev^, pl (0(— 1)))) contains a factor of e(L_i| M ( c )). In order for the stable map 
to be equivariant u(c) must either equal qo = [1 : 0] or q\ . Without loss of generality 
it is equal to qo. Now consider the factor e(R 1 (7r !|< )(ev^ pl (0(— 1)))) corresponding to 
the ao — h linearization. The corresponding linearization of L_i |„( c ) is trivial, implying 
that e(L_i| M ( C )) is trivial, so such a stable map cannot contribute to the Gromov-Witten 
invariant. 

It follows that the only components of the fixed point set that contribute to the Gromov- 
Witten invariant correspond to graphs consisting of one straight edge. Consider such a 
graph with genus gk on the qu vertex and compute the Euler class in the integrand and 
the Euler class of the normal bundle to the fixed point set. 

Start with the Euler class in the integrand. We have seen that the cokernel of the map 
into the fibers over the nodes is trivial in this case. On the other hand, the higher 
cohomology groups of any skyscraper sheaf like L_i |„( c ) vanish. If follows from the 
long exact cohomology sequence that 

(16) H\Y,, u*0(-\)) ^ H\Y,, u*u*0(-\)) 

* H\^ go ;L^\ qo ) © H l (^ gl ;L^\ qi ) © H\CP U , u*0{-\)) . 

We now need to compute the equivariant Euler classes of these bundles. For this we need 
to describe the group actions on the relevant spaces. The degree d line bundle over CP 1 
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may be described as equivalence classes [zo, Z\ , £] where (azo, az\ , a d ^) ~ (zo, Z\ , 
for a G C — {0} . One can check that the action 

A • [zo,zi,£] = [A z ,Aizi, A^A" 1 ^] 
is the linearization labeled by dh — n^a^ — n\a\. This is the answer to Exercise 11.1. 

Exercise 11.2 Show that the action on the fiber over go := [1 : 0] is (d — n^a^ — n\a\. 
Use this to show that the natural linearization on TCP 1 is 2h — ao — a\ . Also show 
that the linearization on a tensor product of line bundles is the sum of the corresponding 
linearizations. 

Exercise 11.3 Recall that the holomorphic sections of the degree d line bundle 
over CP 1 are just degree d polynomials. Use this to show that the linearization 
dh — «oQo — n\a\ on 0(d) turns H°(CP , 0{d)) into the representation 

®l =Q [(k - n Q )a + {d-k- «i)ai] . 

Finally, recall that to get an action on the domain of the degree d map [zq : zi] *— ► [Zq : z\ ] 
one must pass to the Jth power action on the codomain. This is kept track of by dividing 
by d at the end. 

Kodaira-Serre duality implies that 

H\CP l ;u*0{-\))^H () {CV l ;0{d)®0{-2)) y , 

where the 0(d) inherits a linearization from 0(— 1) and 0(— 2) has the natural 
linearization arising as the cotangent bundle (dualizing sheaf) on CP 1 . Using the qo — h 
linearization on 0{— 1) the previous two exercises together with the above remarks give 
the Euler class of the last summand of equation (16) as 

d-2 

(17) e{H l {CP l ;u*0(-l))) = JJ[(c?-£- \)a /d + (k-d + l)m/d] 

k=Q ={d— \)\d l ~ d {ao — at\) d ~ l . 

Similarly, with the a\ — h linearization we obtain 

e{H\CV l ;u*0{-\))) = (-V) d ~ l (d - \)\d l - d {a - aif' 1 . 

Now Kodaira-Serre duality implies that 

H l (E go ;L-!\ qo ) ^//°(S, ; WEsn ) v ®L_i| 9o = E v ®L_i| 9o . 
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Recall that E is the Hodge bundle, which is by definition the bundle over the moduli 
space with fiber over a point isomorphic to the first cohomology of the curve representing 
the point with coefficients in the dualizing sheaf . By Exercise 8.8 we conclude that 

go 

(18) e(H l (Z go ;L^\ yo )) = ^ c,-(E , 

!=0 

where t is the linearization (equivariant first Chern class) of L_i| ?0 . When the 
linearization on L_i is ao — h we have t = 0; when the linearization is ot\ — h we 
have t = a.\ — ao- 

Now turn to the equivariant Euler class of the normal bundle to the fixed point set. 
Consider a component of the fixed point set with < go < g. Formula (8) implies that 
e(aut([E,p]) mov ) = 1, while formula (9) gives 

e(dtm,p]) mov ) = ((a - ai)/d - Vo) ((ai - a )/d - • 

Similarly formula (11) gives 

J]e(r M(c) CP 2 )) = -(a -a 1 ) 2 , 

c 

formula (12) gives 

e(H°{% 0(u*TCP 2 )) mov ) = -(-\) d (d\fd- 2d (a - a Y ) 2d+2 , 
and formula (13) gives 

e(H\t; 0(i/VrCP 2 )) mov ) = 

go \ / Si 



so \ /si \ 

cKE v )(ao - ai) ? °- ! ' J I ^ c,(E v )(ai - ao) 81 ' 1 J . 



i=0 y v i=0 

Localization also works on stacks with virtual fundamental cycles (see Graber and 
Pandharipande [66]). The factor e(ob([u, S,/?]) mov ) does not need to be computed 
because it is accounted for in the virtual fundamental cycle. Automorphisms of generic 
elements of the fixed point components do have to be taken into account. The virtual 
localization formula reads (compare to (6) and (7)): 



1 f t* F 4> 



F 



\K F \ 7 F v„- e(N(F)™) 



Combining all of the above formulas together with (14) and (10) shows that the 
contribution to the Gromov-Witten invariant coming from the go,gi fixed point 
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component is 

^ 1 (-l) d (rf!)- 2 d M (a -air M (rf-l)W _ Hao-Q!i) d "H-l) d " 1 W-l)!^ 1 (Q!o-Q;i) d " 1 

j. go go 

Vc / (E v )(ao-ai) M - 1 c a (E v )V C ,-(E v )(ai-ao) M - , '((ao-ai)/fl'-V'o)" 1 

/_ VcKE v )(ao-a I Fi- 1 Cjl (E v )Vc ! (E v )(a 1 -ao) g '" , '((ai-a )/ a '-V'i)" 1 • 

Using the fact that ct{E y ) = (— l)'c,(£) in general together with the relation 
c(E)c(E v ) = 1 proved by Mumford for the Hodge bundle [116] this contribution 
can be simplified to 

d 2g - 3 I c g0 (E)^°- 2 I c gl (E)^~ 2 . 

Exercise 11.4 Prove that the same formula is valid when either go or g\ is zero 
provided Jj-^j c g0 (K)^ 2g0 ~ 2 is interpreted to be one when go = 0. 

Expressions such as 

b g0 ■= [_ c g0 (E)^- 2 

are called Hodge integrals. The classes c^(E) are called Hodge classes and are often 
denoted by . 

If a different pair of linearizations was chosen for the two 0(— 1) factors, one would 
obtain a very different looking expression for this Gromov-Witten invariant. This 
generates relations between the various Hodge integrals that can be used together with 
similar relations coming from an integral with a 0(1) factor to compute the Hodge 
integrals. This is the approach taken by Faber and Pandharipande [55]. The result 
obtained there is 

s/2 



Ev 28 



% sin(j/2) 
g =o ^ v i ' 

Relations in the cohomology of moduli space allow one to express the cubic Hodge 

integral from Exercise 9.13 in terms of the above Hodge integrals. The answer obtained 

in [55] for g > 2 is 

(l-2g)B 2g B 2g _ 2 
[M sfi] * 8 (2g-2)(2g)! • 

This can be combined with Exercise 9.13 to obtain the following formula for the degree 
zero invariants of Calabi-Yau 3 -folds: 

(-iy-\2g-l)B 2g B 2g _ 2X (X) 



(19) N g n(X) 



2(2g-2)(2g)l 
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12 The Gromov-Witten free energy 

In this final section on the Gromov-Witten invariants, we cover a way to package the 
Gromov-Witten invariants in case all of the unmarked moduli spaces have zero virtual 
dimension. For a Calabi-Yau 3 -fold, this assumption is true. This implies that the 
most interesting invariants are of the form 



These are combined in the following definition. 

Definition 12.1 The Gromov-Witten free energy of a Calabi-Yau 3-fold X is the 
following formal function depending on a complex parameter y and a cohomology 
class t £ H 2 (X;C). 



g=0 (3 

The restricted Gromov-Witten free energy is the sum taken over all non-zero homology 
classes (3. 

It might be interesting to formulate a free energy for more general symplectic manifolds. 
The Gromov-Witten invariants of as originally computed in the previous article are 
given by 



go+gi=g 

where b g are the Hodge integrals computed by Faber and Pandharipande presented in 
the form of the generating function 



Exercise 12.2 Combine the last three displayed formulas to prove that the restricted 
Gromov-Witten free energy of X S 3 is 




oo 
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It is the full Gromov-Witten free energy that appears in the gauge-string duality. The 
full Gromov-Witten free energy is simply the sum of the restricted free energy with 
the degree zero invariant computed earlier. After we give an overview of path integral 
techniques we provide a heuristic argument for combining the Gromov-Witten invariants 
into this generating function. This is contained at the end of Section 15. 

We can think of t as a complex parameter if we identify H 2 (Xg) ; C) with C via evaluation 
on [CP 1 ] . The Gopakumar-Vafa integrality conjecture states that the Gromov-Witten 
invariants are uniquely specified by a set of integer invariants called Gopakumar-Vafa 
invariants (or BPS states) [65]. These BPS states are denoted by n 8 ^ and are supposed 
to be a count of embedded genus g, /-holomorphic curves in the homology class (5. 
The Gopakumar-Vafa integrality conjecture takes the exact form 



Clearly X S i satisfies the Gopakumar-Vafa conjecture with «? cp i, = 1 and the rest of 
the BPS states equal to zero. 
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Part II 

Witten-Chern-Simons theory 

There is a vast amount of literature on Witten's (quantum) Chern-Simons theory, 
conformal field theory, quantum groups and so on. We will try to outline the basic 
notions and definitions that are needed to address Large N Duality. The first subsection 
below reviews background material about surgery and 3 -manifolds. The easiest 
definition of Witten's Chern-Simons invariants is based on skein theory (Appendix C). 
It is however, difficult to to compute the resulting invariants or to show that they are 
well-defined. For this reason we will work with the definition based on quantum 
group. Section 15 describes the physical motivation for these invariants. Path integral 
motivation leads to perturbative invariants. Quantum field theory motivation leads to 
the Reshetikhin-Turaev (exact) invariants. Definitions of the Chern-Simons partition 
function and free energy based on the latter are given in Sections 16.7, 18.1 and 19.2 
after several sections on background and motivation. 

13 Framed links and 3 -manifolds 

Chern-Simons theory provides topological invariants of 3 -manifolds and framed links 
in 3-manifolds. There is a close connection between framed links and 3-manifolds, 
namely any closed oriented 3 -manifold may be obtained by surgery on a framed link in 
the 3 -sphere. 

Definition 13.1 A framed link is an embedding of a finite disjoint union of copies of 
S 1 x D 2 into a 3 -manifold. Two framed links are considered equivalent if they are 
related by an ambient isotopy. 

Any ambient isotopy can be decomposed into elementary isotopies called Reidemeister 
moves. Two framed links are isotopic if and only if they are related by a finite sequence 
of Reidemeister moves. Framed links in M 3 can be cut into elementary pieces called 
tangles (see Figure 16.2). It is often easier to analyze elementary tangles. 

Definition 13.2 A framed (or ribbon) tangle is an embedding of a finite disjoint union 
of copies of S l xD 2 and [0, 1] xD 2 into [0, l]xl 2 taking {0, 1} xD 2 into {0, 1}xE 2 . 
The embedding of {0, 1} x D 2 into {0, 1} x R 2 must be standard, depending on the 
number of components landing on each boundary component so that ribbon tangles 
may be stacked. Tangles are also considered up to isotopy. 
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Figure 13.1: Reidemeister moves II and III 




Figure 13.2: Framed link projection and ribbon link 



In order to move everything into closed 3 -manifolds it is standard to work with the 
one point compactification of M 3 . This is homeomorphic to the 3-sphere. By general 
position we may assume that any framed link in S 3 misses the point at infinity. All 
framed links in S 3 can therefore be brought back to IR 3 and represented by a projection 
of the cores S l x {0} to a plane keeping track of over-crossings and under-crossings. 

To recover a framed link from the projection one first pushes the over-crossings slightly 
above the plane to obtain an embedding 7 : ]J S 1 M 3 . This is then extended to 
an embedding 7: [J S 1 x D 2 ^ I 3 by %t,x,y) := 7(0 + kx + 7(f) x ky. This 
convention is called the blackboard framing. The same technique may be used to 
represent ribbon tangles. Figure 13.2 displays the projection of a framed link and the 
image of S l x ({0} x [0, 1]). This second picture justifies the name 'ribbon'. 

Framed links in S 3 can be used to construct more complicated 3 -manifolds by a process 
called surgery. 

Definition 13.3 Surgery on a framed link 7 refers to removing the image of the 
S l x B 2 's and attaching the same number of S 1 xD 2 's in such a way as to glue { 1 } x S 1 
to 7(5 J x{(0,l)}). 

If you have never seen surgery before, the book by Rolfsen is a good reference [131]. 
Other good references that are relevant to this exposition are Prasolov-Sossinsky [125] 
and Kassel [83]. 
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By a theorem of Lickorish and Wallace any closed oriented 3 -manifold can be obtained 
by surgery on a framed link in S 3 [131]. It is not difficult to see that surgery on 
isotopic framed links produces homeomorphic 3-manifolds. What is less obvious but 
still not difficult is that surgery on two framed links related by an additional move 
called the Kirby move (Figure 13.3) still produces homeomorphic 3-manifolds. In 
fact, Kirby proved that two framed links represent the same 3 -manifold if and only if 
they are related by a sequence of Reidemeister and Kirby moves [85, 131]. Actually, 
what we are calling the Kirby move was introduced by Fenn and Rourke [56]. Kirby 
himself used an equivalent pair of moves: blow up/down and handle slide [85]. Blow 
up/down adds or removes an unlinked, (±l)-framed circle and handle slide tubes 
one component of a link to a parallel copy of a second component. The names come 
from descriptions of 3-manifolds as boundaries of 4-manifolds and the corresponding 
moves on 4-manifolds. 

Exercise 13.4 Show that surgery on the framed link with projection a simple circle in 
the plane gives S 2 x S 1 . 



A framed link in an arbitrary closed oriented 3-manifold can thus be represented by a 
pair of disjoint framed links in S 3 . One of them represents a surgery description of the 
3-manifold and the other represents the framed link in it. It follows that to define an 
invariant of 3-manifolds or framed links in 3-manifolds it suffices to define it on a pair 
of framed links. Of course one then has to prove that this link invariant is also invariant 
under the Kirby moves. There are many invariants of links and framed links, some are 
described in Appendix C. Most of them are not invariant under the Kirby moves and 
therefore do not produce invariants of 3-manifolds. But Chern-Simons theory does 
generate 3-manifold invariants along these lines. 
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14 Physical and heuristic descriptions 

The definition of the Chern-Simons invariants of 3 -manifolds was motivated by quantum 
field theory (QFT) (see Deligne et al [44] and Zee [162]). We can afford to be sketchy 
because this will only serve as motivation for the mathematically rigorous definition 
of the Reshetikhin-Turaev invariants that we use later. The mathematical foundations 
of quantum field theory are not completely developed, but the existing machinery and 
conjectural structure of QFT has produced and motivated many remarkable mathematical 
theorems. The invariants that we are discussing in this section are the expectation values 
of observables of the QFT associated to the Chern-Simons action. These values do not 
depend on any additional geometric structure and are therefore topological invariants of 
the underlying 3 -manifold space-time. Such QFT's are called topological quantum 
field theories TQFT's. 

First recall the Lagrangian approach to quantum mechanics. In this approach the 
partition function is equal to the integral of the exponential of the action that is, 
Z = f e' h s . For the quantum mechanics of a classical particle under the influence 
of a conservative force with potential V the action is S = f ^mx(t) 2 — V(x(t))dt. The 
classical equations of motion are just the stationary curves of this functional. It is 
exactly these stationary points that contribute to highest order in the stationary phase 
approximation of Z. This generalizes in an obvious way for extra degrees of freedom. 
The same framework generalizes to continuum mechanics and field theory. 

The fields in Chem-Simons theory are connections. A connection can be viewed as 
a Lie algebra valued 1-form on the 3 -manifold. Think of the sum A = AjO' where 
the matrices A, live in the Lie algebra and the 6' are 1 -forms. If R is a representation 
of a Lie algebra we define a trace function on the algebra by Tr#(A) = Tr(7?(A)). The 
Chern-Simons action of a U(N) connection is given by 



where □ is the defining representation of U(N). Witten suggested the simple idea that 
the average of a function of this Chern-Simons action taken over all connections should 
be a topological invariant. Indeed, nothing in the definition of CS depends on any 
geometric data [159] except the connection which is integrated out. This average is 
called the Chern-Simons partition function. It is formally written as 



J A 

where VA is an as of yet undefined measure on the space of connections and x is the 





so-called string coupling constant. In the path integral expression one takes x 



2tt 
k 
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where k is a positive integer called the level. It turns out that after performing formal 
perturbative expansion in x one needs to 'renormalize' it to x = , where N is 
the rank of U(N) to get the 'correct' answer. The explanation for this shift is not 
fully understood mathematically and underscores subtleties of infinite-dimensional 
integration. This shift comes from the interpretation of the signature of an operator as 
an eta invariant, see Witten [159] and Atiyah [14]. Because of this some authors call the 
level k and others will call it k + N, so one must be careful when comparing different 
results in the literature. We call the level k. 

This formal representation of the partition function is an example of a Feynman path 
integral (see Etingof [54]). This invariant or various normalizations of it is more often 
denoted by r(M) in the mathematical literature. The problem is that this 'average' 
is not well-defined because the space of connections is infinite-dimensional and a 
translation-invariant measure does not exist. However, there are ways to formally define 
invariants that have most of the properties expected of this 'average'. It turns out however 
that they do depend on an additional geometric structure on a 3-manifold known as 2- 
framing (trivialization of T(TM) up to homotopy) [14, 159]. This phenomenon is called 
gravitational anomaly by physicists and is sometimes explained by the 'measure' DA not 
being purely topological [162]. Gravitation in physics is represented by a background 
metric and a metric in its turn determines many additional structures including a 
2-framing. Realistic quantum field theories such as quantum chromodynamics do 
depend on metric or in physical terms, are coupled to gravity. In (almost) topological 
quantum Chern-Simons theory framing dependence can be seen as a lingering ghost of 
this metric dependence. 

This being said, 2-framing is a very weak structure, so weak in fact that every 3- 
manifold admits a canonical one. Using it one can normalize the partition function so as 
to cancel out the framing dependence altogether. This is exactly the Reshetikhin-Turaev 
normalization of invariants that we adopt in Definitions 16.35, 19.19. It differs from 
the physical normalization used by Witten [159] and this has implications for Large ,/V 
Duality. For instance, Ooguri and Vafa [122] find a much better agreement between the 
gauge and string partition functions than we do. Unfortunately, there is no consistent 
definition of the 'physical normalization'. In examples it is usually derived ad hoc by 
comparing the exact answers to perturbative expansions, see eg Rozansky [133]. 

One can also 'average' holonomies around colored framed links. A colored framed 
link is a framed link with a group representation associated to each component. 
The Chern-Simons invariants of colored framed links are the expectation values of 
observables constructed from the holonomy of connections. In physical language 
these observables are called Wilson loop operators. The holonomy is given by 
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HoU(7) := P exp <f A. More explicitly this means that one solves the system of ODE's 
given by iX(t) + A(j(f)) = with initial data X(0) = I. Given this the holonomy is 
given by HoU(7) := X(l). Here we are assuming that 7(0) = 7(1). 

A Wilson loop operator for one component is the trace of the holonomy to a given 
connection along that component in a given representation. That is W^(A) := 
Ttr(Ho\a(K)) . The link invariant associated to these Wilson loop operators is just the 
vacuum expectation value (vev) or correlation function: 

The invariants defined mathematically based on this motivation (up to various different 
normalizations) are called colored Jones polynomials for SU(2) and the colored THOM- 
FLYP polynomials for SU(A0 . They are sometimes also denoted by J(L, R\ , . . . , R c ) 
for M = S 3 or by t(M, L) in general. 

Remark 14.1 This Wr u ,,^r c (L) is an invariant of oriented framed links, as changing 
the orientation inverts the holonomy. 

After a physical construction of invariants, Witten went further and outlined ideas 
that led to one way of making these invariants mathematically rigorous. He argued 
using skein relations that the expectation values of Wilson loop operators are given 
by the Jones polynomial of the corresponding links. Moreover, he gave an explicit 
prescription for computing Chern-Simons partition functions of 3 -manifolds based 
on their link surgery presentation, see Witten [159] and Axelrod-Della Pietra-Witten 
[19]. This was an amazing insight, but Witten's surgery prescription is a long way from 
a mathematically rigorous definition of the invariants. Reshetikhin and Turaev were 
first to devise a rigorous definition based on quantum groups [129, 130]. This is the 
definition that we will ultimately use. 

There are two philosophically different ways to interpret the expressions for these 
invariants: the perturbative approach and the TQFT approach. Each of these approaches 
can be formalized in different ways. The perturbative approach is outlined in Section 15 
and the exact approach is outlined in Section 16. Witten's original idea is also explained 
in the book of Atiyah [14]. 

15 Perturbative Chern-Simons theory 

Here we outline the perturbative approach to Chern-Simons theory because it motivates 
many of the definitions and conjectures that appear later. Numerous other authors 
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have written expositions on perturbative expansions (see Sawon [137] in this volume, 
Bar-Natan [23] and Polyak [124]). We include an overview here because it helps 
motivate Large N Duality. The perturbative approach is a generalization of two ideas 
for finite-dimensional integrals: the stationary phase approximation for oscillatory 
integrals, and a graphical calculus due to Feynman for evaluating Gaussian integrals 
(see Etingof [54]). 

Recall the stationary phase expansion, 



A theorem of Duistermaat and Heckman asserts that this is exact (with no 0{\~"l 2 ~ l ) 
term) when M is a symplectic manifold and H is an invariant Hamiltonian with only 
non-degenerate critical points [52]. One nice proof of this is based on the localization 
formula discussed in the section on Gromov-Witten invariants. In fact, the localization 
formula was originally discovered in an attempt to better understand why the stationary 
phase approximation was exact (see Atiyah and Bott [15]). 

Similarly, there is an infinite-dimensional symplectic structure on the space of connec- 
tions and the Chern-Simons action is invariant under the action of the gauge group. 
So one might expect that the stationary phase approximation is exact in this setting. 
The critical points of the Chern-Simons action are flat connections (see Baez and 
Muniain [20]) and one can define a perturbative expansion about these flat connections 
by analogy to the stationary phase approximation (see Bar-Natan[23, 22]). For the 
unknot in S 3 the agreement between perturbative and exact invariants has been verified, 
see Bar-Natan-Garoufalidis-Rozansky-Thurston [25]. There are still interesting open 
questions related to the appropriate interpretation of the full expansion on nontrivial 
manifolds since such manifolds admit nontrivial flat connections as critical points for 
the perturbative expansion. 

To better understand the structure of the perturbative expansion we consider a finite- 
dimensional Gaussian integral analog. 

Exercise 15.1 Recall that /(a) = J™ e~ a%1 dx may be evaluated by squaring it and 
then converting to polar coordinates. By taking successive derivatives of f(a) evaluate 



The expressions in the previous exercise become more complicated as n grows and 
are even more complicated for integrals over higher-dimensional Euclidean spaces. 
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Feynman added some slick book-keeping machinery to produce an efficient method for 
computing higher-dimensional Gaussian integrals. These integrals are analogous to the 
path integrals that arise in Quantum Field Theory. 

For finite-dimensional integrals the method is as follows. Let Q be a symmetric bilinear 
form on W and let V be a trilinear form on W . There is an obvious analogy between 
the following Gaussian integral, 

e -h-\Q(x,x)/2+V(,x,xjc)/€) d n x 

and the path integral formally defining the Chern-Simons partition function, 

Z k (M)= [ e^(/M Tr n(^A)+/ M Tr n (|AAAAA)) pA _ 
J A 

Substituting x = y/hy and expanding the second exponential gives 

fti f e -Q(y,y)/2 . e -VhV(y,y,y)/6 d n y 

1 

h( _ () .,.., 6 2 -(2m)!' 

the odd-order terms are missing from the above expression since their integrals evaluate 
to 0. A typical term in this sum may be evaluated by the trick described in Exercise 15.1 
by diagonalizing the quadratic form Q or generalizing the trick to higher-dimensional 
Gaussian integrals (see Sawon [137] and Etingof [54]). The result has the form 

_« (27T)3 h m v-^ 

where a represents a partition of the set { 1 , . . . , 6m} into two-element subsets encoded 
as a permutation on 1 , . . . , 6m and the W a have the form 

n n 2m— 1 3m— 1 

(21) /^j ' ' ' ^! 1J V(. e hk+\i e hk+2i e hk+?) \\ Q ( e 7<r(2<r+l)' e j r y(2k+2))- 

71=1 j 6m =l k=0 k=0 
with ei,...,e n being the standard basis in W. For example, when m = 1 



oo . 

t$Y, / e^y^^^i-VhV^y^d-y, 

r~n Jr." 



n n 



(22) ^2---^2V(e h ,e h ,e h )V(e i4 ,ei 5 ,e i6 )Q \e h ,e h )Q \e h ,e k )Q l (e k ,e is ) 

(1 = 1 ( 6 =1 

is a typical term. Terms with Q^ l (ej l , eQ and Q~ l {ei 2 , e^) are not distinguished, so in 
total we get (j^yr^ 6 "^) = (3my^ im summan ds. This is cumbersome. Fortunately, there 
is a way due to Feynman to represent such terms diagrammatically. 
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We construct a trivalent graph T for each term W a with a vertex for each V in (21) 
and an edge for each Q . The edge labeled Q~ 1 (e a , eb) will connect to the vertex or 
vertices containing a or b . The graph corresponding to example (22) is then represented 
by the graph (Feynman diagram) in Figure 15.1. It is easy to see that different summands 
in (20) give the same contributions as long as they have isomorphic graphs. So instead 
of summing over all partitions a as in (20) we may sum over graphs as long as we factor 
in the number of different partitions associated with each graph properly. Counting the 
number of partitions associated to a given trivalent graph is an elementary combinatorial 
problem with the answer (see Etingof [54]) 

6 2m (2m)! 

(23) # partitions associated to V = - — —, 

|Aut(r)| 

where Aut(T) is the automorphism group of the graph. Here we view a graph as a 
one-dimensional complex and an automorphism must restrict to a linear map on each 
edge. 



Example 15.2 There are 15 partitions of 1 , . . . , 6 into two-element sets, so the sum in 
(20) for m = 1 would have 15 terms. However, there are exactly two trivalent graphs 
with 2m = 2 vertices (see Figures 15.1 and 15.2) so the corresponding sum over graphs 
would only have two terms. The automorphism groups of the graphs in Figures 15.1 
and 15.2 have orders 8 and 12 respectively. 




Figure 15.1: Glasses 




Figure 15.2: Theta 
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Notice that 

X(T) := # vertices(r)— # edges(T) = 2m — 3m = —m. 
Thus the partition function Z can be rewritten as 

<24) z -k^Sisi5ii wm - 

where the sum is taken over the set of all trivalent graphs (including disconnected ones) 
3 and W(T) is the contribution or Feynman amplitude W a of any partition with graph 

r. 

The sum 

taken over the set of all connected trivalent graphs S' is called the^ree energy. 

Exercise 15.3 Show that if represents the disjoint union of T\ and T2 then one 
has 

Wir 2 ) = wFdWFd, 
x(r 1 r 2 ) = x(ri) + x(r 2 ) 



Aut(r;'...r^)l = ni Aut ( r ')l"'^) ! 



i=i 

Use exponentiation and series expansion to conclude that F = ln(Z/Zo) where 
Z = (2irh)"/ 2 /(detQ) 1 / 2 . 

We conclude that it will be helpful to consider the natural logarithm of the exact 
Chern-Simons invariants. 

Of course the notions of partition function and free energy were introduced in the field 
of statistical thermodynamics before they ever appeared in Quantum Field Theory (see 
Schrodinger [138]). In statistical mechanics, the probability that a state at energy E 
in a given system will be occupied is given by e~ E '^ kT \ The partition function in this 

r — — 

context is defined to be the following integral over phase space: Z = } t *q e n <ivol, 
where H is the Hamiltonian. In this context the free energy is defined by F = —kT\nZ. 

The finite-dimensional analogy may be taken further and used to motivate definitions of 
similar invariants of 3 -manifolds and framed links in 3 -manifolds from Chern-Simons 
theory. The common feature of all of these invariants is that they are expressed as 
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sums over graphs analogous to equation (24). Thus it makes sense to introduce the free 
algebra generated by all trivalent graphs with multiplication being disjoint union as in 
Exercise 15.3. Two different combinations of graphs can have the same contribution so 
we introduce relations in the algebra to identify such combinations. 

Example 15.4 One such relation is the IHX relation displayed in Figure 15.3. The 
meaning of this relation is that the contribution of any graph containing the piece on the 
left can be replaced by the contribution of the difference of the two graphs containing 
the pieces on the right. As Figure 15.3 shows this relation implies that the contribution 
of the 'glasses' graph is trivial. 




Figure 15.3: The IHX relation 

It is natural to consider the quotient algebra by these relations to reduce the expression 
for the invariants even further. In fact this sum can be taken over a basis for the quotient 
algebra. The resulting expression for the free energy takes the form 

(25) F(M) = h~ x(T) W G (T) FA(M, T), 

where W G is a homomorphism from the algebra of graphs to the complex numbers 
called a weight system, B is a basis for the algebra of graphs and FA(M, T) is the 
Feynman amplitude associated to the graph and the 3 -manifold. 

We are particularly interested in the weight system for U(N) (see Bar-Natan [22]). This 
weight system applied to a graph can be computed as a sum over labelings. Given 
a graph T label each of the vertices with or 1 , then fatten the graph according to 
the rules in Figure 15.4. The graph then turns into what is called a 'fat graph' which 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



313 




Figure 15.4: Fat Graphs 



topologically represents a Riemann surface with boundary. Let g be the genus of the 
surface and h the number of boundary components. Also, let £ denote a labeling of the 
graph, r^) the labeled graph, A(q its fattened version, and \t\ the sum of all labels in 
l\ then 

(26) W um (T) = J2(-V W N h(A w\ 

i 




Figure 15.5: Fattheta 



Example 15.5 To compute the U(A) weight of the 'theta' graph consider the fat 
graphs corresponding to the labels (0,0) and (0, 1) displayed in Figure 15.5. We 
see that the (0, 0) labeled graph has genus zero and three boundary components, and 
the (0, 1) labeled graph has genus one and one boundary component. The (1,1) 
and (1,0) labeled graphs can be constructed similarly. Thus, for the 'theta' graph 
W U{N \T) = N 3 — N - N + N 3 = 2N 3 — 2N. 

Since the Euler characteristic is a homotopy invariant, 2g—2+h = — x(A^) = — x(T) for 
any labeling. Combining equations (25) and (26) one sees that the U(N) Chern-Simons 
free energy has the form 

oo oo 
g=0 h=\ 

Here x is playing the role of h in our finite-dimensional Gaussian integral. The free 
energy takes the form of the free energy of an open string theory. One expects that 
there is such a theory of 'instantons at infinity' (degenerate curves) but there is no 
mathematical definition of this theory. 
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The fat graphs lead to closed surfaces as explained/conjectured by 't Hooft and these in 
turn lead to the 7-holomorphic curves that are counted on the Gromov-Witten side. See 
Ooguri-Vafa [122] for this idea applied to Chern-Simons theory, 't Hooft suggested to 
'sum over all holes' in this sum to obtain a 'closed string' expansion [76]. This means 
introducing a new parameter t = xN and combining all summands with like powers of 
h. Denoting 

oo 

F g (M) = Y J t h F g , h (M), 
h=\ 

we obtain 

oo 

F£ S = XVs- 2 F g (M). 

g=o 

This expression for the Chern-Simons free energy has the structure of 
the free energy of a closed string theory and is one reason to believe that 
there may be some relationship between string theory and Chern-Simons 
invariants. 

We explain the structure of the free energy of a closed string theory in more detail at 
the end of this section. On the string theory side, the 'instantons at infinity' live in the 
cotangent bundle to the 3 -manifold and are open strings. The cotangent bundle then 
undergoes a geometric transition where the boundaries of the open strings are collapsed 
to points giving closed strings on the manifold on the other side of the transition. The 
manifold across the transition from the cotangent bundle to S 3 is the resolved conifold. 
Thus, one expects that the Chern-Simons free energy is the same as the free energy of a 
closed string theory on the resolved conifold. There is a mathematically defined closed 
string theory on the resolved conifold, namely the Gromov-Witten theory. Identifying 
it as the correct dual theory completes the physical derivation of the duality and was the 
major contribution of Gopakumar and Vafa [65], see also Ooguri-Vafa [122]. 

To summarize, the first step is to describe the Chern-Simons invariants via 
fat graphs that should be considered as open strings. The second step is to 
sum over the 'holes' via a geometric transition to obtain a closed string 
theory. 

If the expression for F^f is rewritten using jV as a parameter in place of x = tN~ l one 
obtains a (1 /TV) -expansion and the g = terms will dominate for large N. When 
the gauge-string duality holds for the leading terms (genus zero contributions) it is 
said to hold in the large N limit. This was the case originally studied by 't Hooft. 
Witten realized that the duality would be exact without considering the large N limit for 
topological (metric independent) field theories, see Witten [161]. 
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The physical ideas here have been encoded into various mathematically defined 
perturbative Chern-Simons invariants [22]. Kontsevich defined a universal Vassiliev 
invariant for links taking the form of a rational linear combination of trivalent graphs 
in an algebra generated by trivalent graphs with a few simple relations, [91, 89]. See 
Bar-Natan [24] for a good time reading about these invariants and see [22] by the same 
author for a more typical overview. Schematically this invariant takes the form 

Z(L) = 

oo 

where (zi ,z[), - ■ ■ , (z m , z' m ) are m pairs of points on the link, and 

T((zi,z'i),...,(Zm,z m y,L) 

is the chord diagram representing the locations of these points on the circles in L. 
Furthermore, E((zi,z'[), ■ ■ ■ ,(z m ,z' m );L) is some expression computed as an integral 
or via intersection theory from the link L and pairs of the points. To get numerical 
invariants one just applies an algebra homomorphism from the algebra of chord diagrams 
to the complex numbers. Such homomorphisms are called weight systems [22]. 

There is a similar set of 3 -manifold invariants. A universal 3 -manifold invariant of 
this type (now called the LMO invariant) was introduced by Le, Murakami and Ohtsuki 
[94], and expressed like the Feynman expansion of a Gaussian integral in [26] by 
Bar-Natan, Garoufalidis, Rozansky and Thurston. As in the case of links, the universal 
3 -manifold invariant is a weighted sum of graphs. This invariant is an element in an 
algebra obtained as a quotient of the algebra freely generated by all trivalent graphs. 

It is interesting to see how string theory considerations suggest that the Gromov-Witten 
free energy will take the form F GW = J2^gy 2s ~ 2 ■ The action for a simple model of a 
vibrating string is 

5 = - j j u tt — u ss ds dt . 

Notice the close similarity between this action and the Dirichlet functional 

D = — j j u tt + u ss ds dt . 

As we saw in Exercise 9.3, the minima of the Dirichlet functional are exactly the 
7-holomorphic maps. One common feature of all string theories is the existence of an 
internal degree of freedom s in addition to the time t appearing in the action. This means 
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that one must consider collections of surfaces in string theory where one considered 
paths in ordinary field theory. 

To discretize a path one just subdivides the interval. To develop a discrete model 
of surfaces it is natural to triangulate the surfaces. Notice that the dual graph of a 
triangulation is a 3-valent graph similar to the Feynman diagrams encountered in path 
integrals. In fact neighborhoods of these dual graphs are the 'fat graphs' that we just 
discussed above. One can turn the process that we used in this subsection backwards 
and write out a partition function that would have these 'fat graphs' in its perturbative 
expansion. The most obvious candidate is the matrix integral 



where the integral is taken over the space of all N x N Hermitian matrices. 

When one performs a perturbative expansion on this partition function the important 
things to notice about each term in the expansion are 

(1) Each vertex contributes a factor of wN . 

(2) Each edge corresponds to a propagator and contributes a factor of N~ 1 . 

(3) Each face (of the dual complex to the triangulation) contributes a factor of N (for 
the sum of the indices). 

It follows that each term in the expansion has order w v N v ~ E+F = w v N 2 ~ 2g . Thus the 
free energy is also a sum of terms of order w v N v ~ E+F = w v N 2 ~ 2g because it is just 
the sum of the contributions from the connected graphs. It follows that the free energy 
can be written as 



Let us summarize the main conclusions of this section. It is natural to package Chern- 
Simons perturbative invariants into a formal partition function, then take the natural 
logarithm, introduce a new variable t = xN and expand into a power series in x. 
Furthermore, this function called the free energy will take the form F = F g x 2g ~ 2 
identical to the form of the free energy in any closed string theory. 

In the next section we review physical motivations behind Topological Quantum 
Field Theory (TQFT) and describe in detail a simplified version of it that can be 
nicely packaged into the language of ribbon and modular categories invented by N 
Reshetikhin and V Turaev. For a particular choice of categories coming from the theory 
of quantum groups these constructions produce the celebrated Reshetikhin-Turaev or 
quantum invariants of framed links and 3 -manifolds that can be seen as a mathematical 
formalization of Witten's Chern-Simons path integrals. 




F = T, FgN '' 



8 
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16 Modular categories and topological invariants 

Chern-Simons theory is a special type of quantum field theory called a topological 
quantum field theory (TQFT). Just as classical mechanics may be described in the 
Lagrangian or Hamiltonian frameworks any quantum field theory may be described 
in these two frameworks. The Lagrangian framework leads to path integrals and the 
Hamiltonian approach leads to canonical quantization (see Simms and Woodhouse [143]). 
We have already discussed the Lagrangian approach and the resulting perturbative 
Chern-Simons invariants. We will use the Hamiltonian framework for formal definitions 
and computations that we do from here on. 

16.1 The Hamiltonian approach to TQFT 

In the Hamiltonian approach, one begins with a symplectic manifold called the phase 
space. This is the collection of all positions and momenta. The mechanical system 
is specified by specifying an energy or Hamiltonian function denoted by H on this 
space. The evolution of the mechanical system is given by Hamilton's equations (see 
Arnol'd [13]). A quantization of a classical mechanical system is a map from the space 
of smooth functions on the phase space to the Hermitian operators on a Hilbert space. 
The relation between the Hamiltonian and Lagrangian is given by 



Here |<^) are elements of the Hilbert space, (<^| are the functional obtained from the 
elements via the pairing on the Hilbert space (bras and kets), H is the Hamiltonian, L is 
the Lagrangian and the right hand side is a path integral (that is, the formal integral over 
the space of paths). 

One can reinterpret this last expression by calling \<p\) a state and considering the 
Hilbert space as the space of states. The operator e' tH just represents evolution through 
t units of time. Then one sees that the last displayed expression expresses the evolution 
of the state \<p\) through time as a path integral. 

Geometrically this suggests considering manifolds with two boundary components and 
associating a Hilbert space to each boundary component. The Hilbert space associated 
to one boundary component corresponds to the initial states and the Hilbert space 
associated to the other component corresponds to the states after evolving through time. 
The path integral should give an operator from the Hilbert space associated to one 
boundary component to the Hilbert space associated to the other boundary component. 
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Gluing two manifolds along a common boundary as in Figure 16.1 corresponds to 
taking the composition of the corresponding operators. The Hilbert space associated to 
an empty boundary should just be C; thus operators corresponding to closed manifolds 
can be interpreted as complex numbers. 




Figure 16.1: Gluing for topological quantum field theories 

Formalizing these ideas leads one to the notion of a topological quantum field theory. 
Slices of 3 -manifolds have 2-dimensional boundary components. Here slice means 
cobordism (that is, a 3-manifold with boundary components £±). One views a product 
cobordism [0, 1] x £ as a space-time with two spatial dimensions. Hence such theories 
are called (2+1) -dimensional TQFT's. A (2+1) -dimensional TQFT associates a 
Hilbert space He to any closed surface and a bounded linear map, TLs_ — > He + 
to every cobordism. This formalism was suggested by Segal [139, 141, 140] and 
axiomatized by Atiyah [14]. 

Aside 16.1 Recall that the TQFT approach has origins in the Hamiltonian framework. This 
is a brief outline of the Hamiltonian description of Chern-Simons theory. In Chern-Simons 
theory the Lagrangian is just the Chern-Simons functional and the corresponding Hamiltonian 
is zero. The phase space used for the Hamiltonian description of Chern-Simons theory is the 
space of flat connections modulo gauge equivalence over a surface £ of genus g . Equivalently 
this is the space of all connections modded out by the complexified gauge group (the geometric 
invariant theory picture) or the symplectic quotient of the space of connections by the gauge 
group (the symplectic reduction picture). Denote this moduli space of flat connections by A4s ■ 

In order to construct the associated Hilbert space for Chern-Simons theory, we need to introduce 
a line bundle over the moduli space /As . Quillen's determinant line bundle is the complex line 
bundle over M. g with fiber 

top top 

C A = f\ Cker(40)* <g> f\ (coker^)) , 
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where d& : Qr >°(E,E) — ► Q ' (£, E) is the covariant derivative associated to the flat connection 
A on the bundle E. The associated Hilbert space is then Tiy, := H a (M^,C m+N ). One should 
note that sometimes the quantity k + N is called the level and is sometimes denoted by k. For 
general Lie groups the two different notions of level are related by the so-called dual Coxeter 
number. In this paper we will always use k to be the level as used in the definition of the 
string coupling constant. We discuss this in greater detail in Appendix E. More information 
on the above description of Witten-Chern-Simons theory may be found in Axelrod-Della 
Pietra-Witten [19], Hu [78], Di Francesco-Mathieu-Senechal [45] and Kohno [87]. 

While the motivation for this approach is fairly straightforward, formally constructing 
invariants in this manner is very complicated. There is an alternative approach that is 
more difficult to motivate but slightly easier technically. 

16.2 Link invariants in a U{\) theory 

Instead of cutting 3-manifolds into cobordisms we use the fact that every 3-manifold 
can be expressed as surgery on a framed link to reduce our considerations to framed 
links. Just as every 3-manifold can be cut into cobordisms every framed link can be 
cut into elementary framed tangles. This is easier to draw and conceptualize. 

To pass from a TQFT describing 3-manifold invariants to a corresponding theory for 
framed tangles the notion of a (2+l)-dimensional TQFT was enhanced by Reshetikhin, 
Turaev and many others [129]. One includes Wilson loops (framed links) into the 
manifolds. Formalizing the entire picture with Wilson loops in general is fairly 
complicated because one has to consider framed tangles in general 3-manifolds with 
surfaces of arbitrary genus as boundaries. 

It is easier to formalize this picture for the special case of framed tangles inS 2 x [0,1]. 
These are usually viewed as framed tangles in Mr x [0, 1] as in Figure 16.2. This proves 
to be sufficient because any framed link can be obtained by stacking such tangles and 
any 3-manifold can be obtained by surgery on a framed link. 

Before describing the general case in the next section we are going to introduce the 
cutting and pasting idea in this section in the setting of U(l) Chern-Simons theory. In 
the U(l) theory the cubic term in the Chern-Simons action vanishes because purely 
imaginary 1 -forms anti-commute. Thus one expects considerable simplification in the 
U(l) case. 

According to the discussion on topological quantum field theories with framed tangles, 
one should be able to cut a framed link into elementary pieces and associate invariants 
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to each piece. This is indeed the case. Any framed link can be cut into cups, caps, right 
crossings, left crossings and twists (see Figure 16.2 for an example and Section 16.4 for 
formal definitions). 

The simple U(l) theory at level 2m + 1 produces an invariant of colored, oriented 
framed links. Appropriately drawn link diagrams can be oriented by putting an upward 
pointing arrow on the left branch of each cup. Right and left crossings are then defined 
so that the crossings in the simple diagram for the left Hopf link from Figure 16.2 
are left crossings and the opposite crossings are right crossings. Left and right refer 
to the direction of the strand that goes under the crossing. Coloring is captured by 
labeling every cup with a number p from "Lim+x subject to a compatibility condition. 
The evaluation rule is very simple. In the obvious analogy with Feynman rules, every 




Figure 16.2: The left Hopf link 

/^-labeled cup on a framed link creates a pair ±p with p on the left and — p on the 
right. The rest of the link receives labels by extension once we choose a 'color' p for 
each cup (see Figure 16.2). The compatibility condition is that every cap annihilates 
such a ±p pair. Once the link is colored, the U(l) theory at level 2m + 1 associates the 
following numbers to elementary pieces: 

| = 1 id x p>q = e 27ri M /(2m+i) right crossing 

U = 1 cup x p l q = e -2ni Pq /(2m+i) left crossing 

n = 1 cap B p = £27np 2 /(2 m +i) twist 

The colors p, q can be any natural numbers from to 2m. The final colored invariant 
Jp l: ...,Pi(L) of the framed link is just the product of all numbers assigned to the pieces. 
In analogy to Feynman rules, the quantum invariant F(L) is just the sum of the colored 
Jones polynomials over all possible ways to color the link. 

Example 16.2 Consider the left Hopf link L from Figure 16.2 with the larger component 
colored by p and the smaller one colored by q. Multiplying from the bottom up we 
compute 

i (J\— v -1 .*- 1 ./} .v . f) . y _ -4Tripq/(2m+\) 

Jp,q\J^) — ^p.q ^q,p u 1 A 1, — <l U P *P,~P ~ e 
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Therefore, F(L) = £^=o e - 47Ti ™/ i2m+l) = 2m + 1. 

The 'color and multiply' rule is not very sophisticated. In particular, it implies that 
any two links with the same elementary pieces have the same invariant no matter how 
those pieces are assembled. Of course, this makes it easier to prove independence of the 
presentation of a link by a regular projection but it also yields rather weak invariants. 
We want evaluation rules that are still independent of the projection but 'see' much 
more structure of the link. The right balance for link invariants is struck in the notion 
of ribbon categories. There colors are replaced by simple objects (think irreducible 
representations), numbers by morphisms (think linear maps) and multiplication by 
composition and tensor product. Accordingly, the evaluation rules become more 
involved. 

To get invariants of 3 -manifolds one takes weighted sums of colored invariants such 
as the one computed in the previous example. The hard part is to make sure that the 
final answer does not depend on how the manifold is expressed as a framed link and 
how the framed link is decomposed into elementary tangles. The required axioms are 
formalized in the notion of a strict modular category. In the rest of this subsection we 
introduce ribbon categories first, then discuss modular categories and evaluation rules, 
and finally the quantum invariants that arise from these categories. 

16.3 Ribbon categories 

As explained in the previous subsection the invariants that we are considering are 
naturally defined for framed tangles in R 2 x [0, 1] and satisfy formal gluing axioms. 
We will follow the version of these invariants due to Reshetikhin and Turaev [130] as 
explained by Bakalov and Kirillov [21] and Turaev [152]. 

It is important to keep the basic idea in mind. Every 3 -manifold may be expressed as 
surgery on a framed link. 

Associate a simple invariant to each elementary piece of a framed link 
diagram and define the final invariant to be an algebraic combination of all 
of the elementary pieces. Of course, one must know that different ways to 
assemble the same manifold give the same final invariant. 

Axiomatizing assembly rules with outcomes independent of a link presentation leads to 
the notion of modular categories. 

The basic outline is as follows. Any framed link may be constructed out of elementary 
building blocks that look like Xu,v, Ov, and Uv from Figure 16.3. The trick is 
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to associate an invariant to each of these and then write out all of the axioms that 
correspond to changing the height function isotopy of the link, or Kirby moves. In 
particular, one colors and orients the link. To each generic horizontal line one associates 
a tensor product of objects and their duals according to the sign of the intersection of 
the link with the horizontal line. The elementary building blocks between the horizontal 
lines induce morphisms between the associated objects. We use the standard convention 
that combining elementary building blocks side-by-side corresponds to taking the tensor 
product of the associated morphisms. 

There are many types of categories related to modular categories. It is helpful to consider 
a related category with an easier definition first. The categories related to framed link 
invariants (as opposed to 3-manifold invariants) are ribbon categories. 

Definition 16.3 A strict ribbon category is a category with a unit object ¥ , tensor 
product functor ® , families of isomorphisms x , called the braiding and the twist 
respectively, and a duality triple (*, U, n) satisfying axioms 1 through 12 under (16.5) 
below. 

Remark 16.4 We are slightly changing notation from Turaev [152] and other references. 
The correspondence is = C[/,y, Oy = Oy, Uy = by and Hy = dy. In fact, our 
notation is often used to represent objects in a colored ribbon category. There is a 
natural functor taking the colored ribbon morphisms to morphisms in a strict modular 
category. We feel that no confusion will arise by using the same notation for both, and 
the ribbon notation is more descriptive. 

Remark 16.5 One can obtain a new strict ribbon category by replacing the braiding 
and twist by their inverses. 

The first example of a strict ribbon category is the category of representations of a group. 

Example 16.6 Let REPg be the category of representations of a Lie group. The 
objects are representations p: G —> Aut(V). The unit object is the trivial representation. 
Moiphisms are equivariant linear maps f(gv) = gf(v). The dual representation 
V* has the dual space to V as the representation space and the action is given by 
(g<p)(v) := ip(g~ v). The pairing U is the standard duality pairing for vector spaces and 
n the copairing given by 1 h-> e\ e where {ek} and {e k } are dual bases. The 
image of 1 under the copairing is sometimes called the Casimir element. The tensor 
product is the standard one in the category of finite-dimensional vector spaces with 
action given by tensor product of actions as described in Appendix D. The braiding 
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is given by x^ f (v ® w) = w (8> v, and the twist is given by 6*y = idy. As is, this is 
not a strict ribbon category because equality signs in the axioms are only canonical 
isomorphisms. For example V is not equal to V (8> C. In truth, we should be talking 
about equivalence classes of representations rather than representations themselves. 
This requires some tweaking in the notion of morphisms and ribbon operations that can 
be done in a standard way by the Mac Lane coherence theorem [101]. 

The category of representations cannot be used to construct nontrivial link invariants 
following the procedure given in the next article because the braiding is its own inverse 
(this category cannot detect the difference between right and left crossings.) The 
nontrivial invariants described in the previous subsection can be seen to arise from a 
strict ribbon category. 

Example 16.7 Construct a category W(l)2m+l with objects elements of ^m+i , unit ob- 
ject equal to 0, only the zero morphism between unequal objects and the endomorphisms 
of any object equal to C. The tensor product is given by 

(f: p^q)®(g: r -> s) : = fg: p + r^q + s, 

and the braiding given by 

X M = e 2 ^ /(2m+1) : p + q^q+p. 

The twist is given by 

6 p = e W/C2m+l) . p ^ p 

The duality pairing is n p : p + (— p) — ► and the copairing is U p = 1 : — > p + (—p). 
One sees that this weird category is strict and satisfies all of the axioms for a strict 
ribbon category. 

Exercise 16.8 Prove that in a strict ribbon category = idy = y and the 

following Yang-Baxter equation holds 

( x v, w <8> idf/) ° (idy <8> X u,w) ° ( x u,v ® idw) 

= (idty (8) x j^y) o (x ® idy) o (idy x y>w ) . 

There is a way to represent the axioms and other formulas with ribbon operations 
graphically making them much more intuitive. The correspondence between elementary 
operations and graphs is depicted on Figure 16.3. To draw the picture corresponding to a 
formula start from the right (as in reading Arabic) and draw the pieces corresponding to 
each expression between two compositions on the same horizontal level while moving 
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Figure 16.3: Strict ribbon category 



from the bottom up and putting the obtained pieces on top of each other. Conversely, 
given a labeled graph draw horizontal lines not intersecting any crossings, twists, 
maxima or minima (see Figure 16.2). Next write the expression for each elementary 
piece on the same level and tensor them. Finally assemble the expressions right to left 
by compositions moving from the bottom to the top of the graph. The graphs cannot be 
labeled arbitrarily; the labels on successive horizontal levels must match. If a ribbon 
expression produces a graph with mismatching labels it is nonsensical: in categorical 
language you would be trying to compose morphisms with targets and sources that do 
not match. 

The main advantage of using graphs is that one can immediately see if two expressions 
in a ribbon category are equal. 

If two (correctly composed) expression graphs represent isotopic framed 
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tangles the expressions are in fact equal and the equality can be established 
using the axioms; see Bakalov and Kirillov [21]. 

Framing is important here. Even though we draw pictures with strands one should 
actually think of them as very thin ribbons so that the twists (depicted as curls on the 
strands) cannot be undone. Later we will slightly enhance the notation to allow arrows 
on the strands but for now this will suffice. 

The motivating example of a ribbon category is the category of ribbon tangles. 

Example 16.9 The objects of the category of ribbon tangles are just non-negative 
integers, with zero representing the unit object. The morphisms between n and m are 
just the isotopy classes of framed tangles from n points to m points. Our graphs of 
expressions are just plane projections (with indication of under- and over-crossings and 
twists) of framed tangles with labels being natural numbers. The tensor product is the 
sum and the dual of n is n itself. The braiding, twist and duality for single strands are 
displayed in Figure 16.4; in general one just has to put n and m strands parallel to the 
one or two depicted. Invariants produced by this category are complete but useless: 
the invariant of an isotopy class is the isotopy class itself. Fortunately, there are more 
interesting examples. 



x e n u 

Figure 16.4: Category of ribbon tangles 

The pictures for axioms 6 and 7 are displayed in Figure 16.5. These axioms correspond 
exactly to elementary isotopies. For example, a combination of axioms 6 and 7 implies 
the third Reidemeister move as shown in Figure 16.5. 

Exercise 16.10 Draw diagrams representing each of the braiding, twist and duality 
axioms. Remember that it is standard to compose maps from the bottom up in a diagram. 

16.4 Modular categories 

While looking at axioms 1-12 you may have noticed that the remaining ones 13-17 
have a different flavor. Rather than describing algebraic properties of operations they 
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describe the global structure of a category. The idea is that the first twelve axioms take 
care of the Reidemeister moves and produce link invariants. Surgeries on isotopic links 
produce diffeomorphic 3 -manifolds but diffeomorphic 3 -manifolds are also produced 
by non-isotopic links related by Kirby moves. The invariants we are looking for must 
take the same values on such pairs of links. The Kirby move has a more complex 
structure than the Reidemeister moves and one requires the modular axioms to account 
for it. 

The point is that any ribbon category generates many framed link invariants. 
There is a framed link invariant for each object in the category or more 
generally one can define invariants of framed links colored by objects in 
the category. The hope is that an appropriate linear combination of the 
resulting framed link invariants will be invariant under the Kirby move 
and thus define a 3 -manifold invariant. 

Before stating the axioms of a strict modular category we need to define a few terms used 
in the axioms. Axiom 13 simply says that we can add morphisms with the same sources 
and targets and this addition behaves the same way as linear operators on vector spaces. 
Categories satisfying axiom 13 are called preabelian. Denote End(V) := Mor(V, V) 
and notice that (End(U^), +, o) is a ring. Moreover, it is a commutative ring. 

Exercise 16.11 Use/ o g = (f <g) id^) o (id^ <8> g) to prove that End(F) is commutative 
based on axioms 1-13. Hint: recall that ® is functorial. 
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Exercise 16.12 Prove that Oh = 6^ and 



9v®w — x w,v ° x v,w (yv ® "w) ■ 



In the light of the above we will sometimes omit the composition sign between 
moronisms when composing them. The analogy with vector spaces goes further: one 
can define 'traces' of endomorphisms and 'dimensions' of objects. 

Definition 16.13 The quantum trace off G End(V) is 

Tr q (f) := D v o x v ,v* ° (Ov <8> idy) o (f <g) idy)Uy G Endf^) . 

The corresponding graph is shown in Figure 16.6. The quantum dimension of an object 
is dim 9 (V) := Tr g (idy). 



Exercise 16.14 Prove that Tr q (fg) = Tr q (gf) for all/ and g where the compositions 
make sense, Tr q (f) = f for all / G End^), and Tr q (f (g> g) = Tr q (f)Tr q (g) for 
all morphisms / and g. Conclude that for any pair of objects dim 9 (V <g> W) = 
dim^^dim^W). 

Example 16.15 In the category REP^ of Example 16.6 we have 



so the quantum trace reduces to the ordinary one and dim (y (V) = dim(V). 
Exercise 16.16 In the category U(\)2m+\ of Example 16.7 show that Tv q (f) =f. 
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In preabelian categories one can define an analog of irreducible representations. 

Definition 16.17 To say that an object V is simple means that the map End(U^) — ► 
Mor(V, V) given by / i— ► / ® idy is an isomorphism. To say that such a category 
is dominated by {V\}\ e j means that for every V G Ob(V) there are morphisms 
fr ■ V Xr -> V and g r : V -> V\ r such that id v = ° gr- 

Remark 16.18 Domination is the preabelian analog of semi-simplicity in Abelian 
categories. The point is that while one can take sums of objects in Abelian categories 
one can only take sums of moiphisms in preabelian categories. 

The Schur lemma implies that in the category of representations of a Lie group REP^ 
the simple objects are the irreducible representations. However, any nontrivial Lie group 
has infinitely many simple objects. The category of finite-dimensional representations of 
a semi-simple Lie group is dominated by the simple objects because any representation 
decomposes into irreducible ones. In the category from Example 16.7 every object is 
simple and there are only finitely many objects. Clearly, this category is dominated by 
its simple objects. 

The fact that End(l^) is a commutative ring allows one to define the matrix with the 
following entries (to be used in axiom 17): 

(27) Sa m := Tr^x^A o x AiAt ) 

From Exercise 16.14 we see that s is symmetric. 

Exercise 16.19 Show that the S-matrix defined in (27) is 

°(XA 8fl ® id^A*) ° Ua® m G End(^). 

A graphical representation of the S-matrix is given by the right Hopf link as displayed 
in Figure 16.10. 

We are now ready to define a strict modular category. We refer our readers to Bakalov 
and Kirillov [21] and to Turaev [152] for further description, properties and related 
definitions. The reference [21] uses modular tensor categories as opposed to strict 
modular categories. The difference is that the underlying category in a modular tensor 
category is Abelian rather than just preabelian, that is, direct sums of objects are defined. 
Even though our main example is in fact a modular tensor category, at this point it is 
expedient to ignore the additional structure. 
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Strict modular category ingredients 


Example 


A category V 

A tensor product ® : V x V => V 

A unit IK e Ob(V) 

A braiding x u>v : t/ <g) V -> V <g) £/ 

A twist 9 V : V — > V 

A duality pairing Ry : V* <g> V — > iK 

A copairing Uy : F — > V <g> V* 

A finite collection of simple objects 

{Va}a £ / 


Ob(W(l)2 m +l) = Z 2m+ i 

Mor(p, g) = C if /? = q and otherwise 

if: q)®(g: r^s)=fg: p + r -> q + s 

€ Z 2m+ i 

X M = ^^/am+i) . p + q ^ q+p 

p = e 2^p 2 /(2m+l) . p ^ p 

n p : p + (-/>) -> 
Up = 1 : 0^p + (~p) 
V p =p,I = Z 2m+ i 



Table 16.1: Strict modular category ingredients 



Definition 16.20 A strict modular category is a category with a tensor product functor, 
natural isomorphisms x , 9, a duality (*, U, fl) together with an indexed collection of 
special objects {V\}\ei satisfying (all 17 of) the axioms under Section 16.5 below. 

In particular any strict modular category is a strict ribbon category. The ingredients 
in a strict modular category are listed in Definition 16.4. This table also includes the 
ingredients of the U(l)2m+i category from Example 16.7 as an illustration. 

16.5 Axioms defining a strict modular category 
Tensor axioms 

Axiom 1 v®r=r®v = v. 

Axiom 2 U <8 (V <8 W) = (17 <8 V) <8 W. 
Axiom 3/(8) idj^ = id^ (8/ =/. 
Axiom 4 / <8>(g®h) = (f ®g)®h. 
Braiding axioms 

Axiom 5 x UjV <g,w = (idy <8 x-u,w) ° ( x J/,v (8 id^). 
Axiom 6 Xygy^ = (x UiW ® idy) o (idj/ (8 Xy^). 

Axiom 7 (g<8/)° Xu,w = ~Xv,z (f®g) for anymorphisms/: U^V,g: W^Z. 
Twist axioms 
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Axiom 8 Oy®w = Xw,v x v,w ° (Ov ® Ow)- 

Axiom 9 / o On = V of for any morphism/ : U — > V. 

Duality axioms 

Axiom 10 (idy (g) fly) o (Uy idy) = idy. 
Axiom 11 (Hy <g> idy») o (idy* (g) Uy) = idy. . 
Axiom 12 (Oy <8> idy.) O Uy = (idy (g) 0y») o Uy. 
Modular axioms 

Axiom 13 Mor(V, W) are abelian groups and o : Mor(V, W)xMor(U, V) — > Mor(U, W) 
is bilinear. 

Axiom 14 V is dominated by a finite collection {Va}as/- 
Axiom 15 There is G / such that V =¥. 
Axiom 16 For every AG/ there is a A* G / such that V\* = . 
Axiom 17 The matrix s\ >fl = Tr 9 (x^ o x^ i(U ) is non-singular. 

The tensor axioms are not difficult to understand if one keeps the example of vector 
spaces in mind in which case ¥ is just the underlying field. Graphical representations 
help one understand the braiding, twist and duality axioms. As we emphacized in 
Section 16.3 they simply catalogue elementary transformations of tangle diagrams. In 
fact, one can forget about them and work with diagrams directly. Axioms 7 and 9 just 
restate the naturality of x and but we included them for the sake of diagrammatic 
interpretation. For instance, axiom 9 means that one can slide the twist through any 
morphism. The next example is intended to help illustrate the modular axioms. 

Example 16.21 The category REP^ for an infinite group G fails to be strict modular for 
the trivial reason of having infinitely many simple objects (irreducible representations). 
But even when \G\ < oo this category is not strict modular. It comes very close though: 
the only thing that goes wrong is the non-degeneracy axiom 17. Indeed, for any pair of 
objects (representation spaces) we have by Example 16.15: 

Tr ? (x Wi y o x Vil y) = Tr^idy^iy) = dim^V ®W) = dim(V)dim(W) 

and the structure matrix = dim(V r A)dim(V M ) always has rank 1 . We get non- 
degeneracy for \G\ = 1 that is, the trivial group, but this is a rather trivial example. 

On the other hand, the category U(l)2 m +i from Example 16.7 is both strict modular 
and nontrivial. In fact, every object is simple so it is definitely dominated by simple 
objects, and there are only 2m + 1 < oo of them. By Example 16.15 

S pq = r Y J - tj ( e 2ni Pq/( 2m + 1 ') o g 27r 'P9'/( 2m + 1 )) = e 4nipq/(2m+l) _ ^pq 
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with z := e 4m 7(2m+i)_ 

Exercise 16.22 Verify that the category U(\)2 m +\ from Example 16.7 and the Defini- 
tion 16.4 satisfies the definition of a a strict modular category. Hint: notice that det z pq 
is a Vandermonde determinant. 

We extend the honor of being named a number to the elements of End(JO which is, 
after all, a commutative ring. In all examples of interest to us End(^) = C anyway. 

Definition 16.23 The characteristic numbers of a strict modular category are given 
by d\ := Tr ¥ (id>,), := Tr q (8^ 1 ), where A indexes simple objects. The numbers 

:= Ylx^iPx = ^2xei^f l< ^ x are called the twists and P := (X^a)'^ 2 the quantum 
diameter (also rank or dimension; see Bruguieres [35] and Miiger [1 12]) of a category. 

Remark 16.24 Note that the characteristic 'numbers' are defined in any ribbon category 
not just modular categories. The numbers p^,T) are instrumental in making sure that 
framed link invariants defined by a modular category are invariant under Kirby moves 
and hence define 3-manifold invariants. The square root we need to take to define V 
may not exist in the ring End(IK) and, when it does may not be unique. It is possible to 
extend an arbitrary strict modular category so that this square root does exist and there 
is no important difference between choosing any of the two roots (see Turaev [152]). 
Numerical values of invariants do however depend on a choice of quantum diameter. In 
examples of interest to us End(^) = C and V 1 is a positive real number so we agree 
to always choose the positive square root. For instance, in U(\)2 m +Y there are 2m + 1 
objects of dimension one each so V = (2m + l) 1 / 2 . For REPg with G finite the sum of 
the squares of dimensions of irreducible representations is \G\ by the Burnside theorem 
(see Fulton and Harris [62]) so V = \G\ l l 2 . 

16.6 Coloring, double duals and the arrow convention 

Before introducing invariants of links and 3-manifolds we augment the graphic notation 
introduced in Figure 16.4. You may notice that it is impossible to label a simple 
circle coherently with the rules we have so far because Uy and Hy put together have 
mismatching labels V, V* . We can isotope the circle as in Figure 16.6 (without/) so 
that it can be labeled consistently. Isotoping links into shapes that allow labeling every 
time leads to cumbersome pictures as in Figure 16.2. Instead we can simply add a dual 
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pairing and copairing (cups and caps) to the notation. Namely, define U v : ¥ — > V* (g> V 
and n* v : V V* -> ¥ by (see Figure 16.7): 

Uy = (9y} <g> idy) O Xy V* ° 

Dy = Dv o Xy 5 y. o (9y (g) idy*). 

Using the dual pairing and copairing we can re-express the quantum trace of / : V — > V 

V* V 




Figure 16.7: Dual cups and caps and the arrow convention 
as Ttq(f) = fly o (f ® idy*) o Uy and simplify all the definitions that use it. 

Exercise 16.25 Show that Uy and Pi v in the category U{Y)2m+\ from Example 16.7 
are multiplications by 1 . 




Figure 16.8: Twist from the double dual isomorphism 

Using Uy and n y we can label any diagram consistently with V and V* only. The final 
simplification is to get rid of V* as well by placing arrows on the strands and agreeing 
that an up arrow on a strand in a V-labeled component corresponds to the V label and 
a down arrow on the same component corresponds to the V* label. With the arrow 
convention all we have to do in order to label an entire link or tangle is to orient every 
component and label it with an object in a single place, that is, color it. To evaluate a 
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colored graph one simply has to transform it into the ribbon expression according to the 
rules on Figures 16.7 and 16.4 and tensor and compose all the moiphisms. 

Exercise 16.26 Show that for the category U(V)2m+\ from Example 16.7 this reduces 
to the 'color and multiply' rule of Section 16.2. 




Figure 16.9: Dual pairing via the double dual isomorphism 

One may wonder why double and higher duals in the category have not been discussed. 
What we are doing of course is implicitly identifying V** with V. In any strict modular 
category it is possible to do this explicitly. 

Exercise 16.27 Show that there exists a natural isomorphism V V** in any strict 
modular category such that the twist is given by Figure 16.8 or 

Oy = (idy* tg) fly) o (idy (g> 6y ® idy.) o (Xy,v ® idy*) o (idy (g) Uy). 

The right way to do this exercise is to draw a colored ribbon tangle representing a map 
from V to V** built from elementary pieces. The answer is 

5v = (fly (g idy.) o (Xyy* (g idy..) o (9y (g idy**) o (idy (g Xy**,y*) 

o (idy <g OyL <g idy.) o (idy <g Xy* y..) o (idy <g Uy.) o 6y. 

We call 5 from the exercise the double dual isomorphism. Using this isomorphism we 
can streamline the definitions of the dual pairing and copairing (see Figure 16.9) 

U* v = (idy* (8) Sy 1 ) o Uy* 

n* v = rv o (5 V & idy*). 

The best part about knowing 6y explicitly is that by using it one can evaluate a number of 
graphs without evaluating any braidings in the process. We we will take full advantage 
of this fact when discussing the modular categories coming from quantum groups 
because as in many other nontrivial modular categories, it is the braiding that is the 
hardest to compute. In particular, we get the following braiding-free formula for the 
quantum trace (Figure 16.6): 

Tr q (f) := fly* o (Syf ® idy.) o Uy. 
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Exercise 16.28 Show that the double dual in W(l)2 m +i from Example 16.7 is the 
identity map C — > C and the double dual in REPp is the standard isomorphism between 
a vector space and its double dual 5v(y)(ip) := ip(v). 




Figure 16.10: Characteristic numbers of a strict modular category 



Our conventions thus far only allow evaluation of graphs that are completely colored 
(labeled). But usual framed links do not have representation labels attached to their 
components. One can make sense of unlabeled graphs as well. The trick is to come up 
with evaluation rules such that the resulting expressions do not change at least under the 
Reidemeister moves (for link invariants) and even better, do not change under the Kirby 
moves (for 3 -manifold invariants). There are many different ways to color a particular 
graph and the hope is that an appropriate linear combination of the resulting evaluations 
will be invariant under the appropriate moves. It turns out that there is an essentially 
unique way to evaluate unlabeled graphs or unlabeled closed components in a graph 
that guarantees invariance under the Kirby moves (see Turaev and Wenzl [153]). We 
therefore introduce the following important convention. 

In any diagram we sum over all ways of labeling unlabeled closed compo- 
nents by simple objects A with multiplicity d\ . 

Examples using our extended graphic notation to visualize the definitions of the S -matrix 
and the characteristic numbers of a modular category are shown on Figure 16.10. The 
next article explains in detail how to construct framed link invariants from strict ribbon 
categories and how to construct 3 -manifold invariants from strict modular categories. 

16.7 Invariants from modular categories 

To see the correspondence between the Chern-Simons invariants and strict modular 
categories one needs to describe how to build 3 -manifold invariants from a strict 
modular category. We first define the colored Jones polynomial of an oriented framed 
link L with c(L) components. Color the components with objects V\ , . . . , V c (i) from 
the strict ribbon category. Notice that a diagram with only closed components represents 
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the element of End(H^) obtained by taking the composition of all of the basic morphisms 
corresponding to the elementary framed tangles occurring between horizontal lines as 
in Figure 16.2. The axioms of a strict modular category ensure that this element is 
invariant under all elementary isotopies of the link. 

Definition 16.29 The colored Jones polynomial of an oriented framed link is the 
element Jv lt ...,v c(L) (L) corresponding to the morphism represented by the labeled link 
diagram. 

Remark 16.30 This invariant is not a polynomial. The name comes from the fact 
that the invariant associated to the category of tilting modules is closely related to the 
classical Jones polynomial [81] (which by the way, is not a polynomial either). 

Definition 16.31 Given a framed link (not oriented) L define the invariant 

F(L) = J\i,...,\ m (L)d X] d ML) . 

Just as any isotopy of an ordinary link can be expressed as a composition of elementary 
isotopies (Reidemeister moves I, II and III) any isotopy of ribbon tangles with height 
function can be decomposed into elementary isotopies. According to Turaev [151], 
Freyd and Yetter [59], and Reshetikhin and Turaev [129], the elementary isotopies for 
ribbon tangles with height function are Reidemeister moves II and III (see Figure 13.1) 
together with the moves displayed in Figure 16.11. 




Figure 16.11: Elementary ribbon isotopies 
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Exercise 16.32 Show that the invariant F(L) is well defined when / is any collection 
of objects in a ribbon category, ie that it is invariant under elementary isotopies. 

More generally given a framed link Lm U L where L is colored and oriented define 



Remark 16.33 This is well defined independently of orientations on Lm because 
labeling a component with A is equivalent to changing the orientation on the component 
and labeling it with A*. By axiom 16 the sum is symmetric with respect to taking 
duals. This is also consistent with the sum over all colorings of unlabeled components 
convention. 

Let M be the manifold obtained by surgery on a framed link Lm . Even though F(Lm) is 
invariant under the Reidemeister moves as is, it is not invariant under the Kirby moves. 
Luckily, invariants of non-isotopic links defining the same manifold are related in a very 
simple manner and we can multiply F(Lm) by normalizing factors that cancel out this 
dependence. 

Given a framed link, Lm , one defines a linking matrix ny(L) with off-diagonal entries 
equal to the linking numbers of the corresponding components and with diagonal entries 
equal to the self-linking or writhe of the corresponding component (see Prasolov and 
Sossinsky [125] and Rolfsen [131]). 

Example 16.34 The linking matrix of the framed left Hopf link from Figure 16.2 is 
given on the left below and the linking matrix of the framed link from Figure 13.2 is 
given on the right. 



Let ct{Lm) denote the signature, that is, the number of positive eigenvalues minus the 
number of negative eigenvalues of the linking matrix. With these notations one can 
show that the quantity r(M) defined below is a topological invariant (see Bakalov and 
Kirillov [21] and Turaev [152]). This means that it does not depend on the link used to 
represent M (so any sequence of positive or negative Kirby moves or their inverses may 
be applied to Lm without changing the answer). 

Definition 16.35 The Reshetikhin-Turaev invariant of a 3 -manifold is given by 








F{L M ) ■ 
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It is also possible to define invariants of oriented colored framed links in 3-manifolds. 
A colored framed link in a 3 -manifold can be represented by a link in S 3 with some of 
the components labeled with objects from a strict modular category. Let Lm denote 
the sublink consisting of unlabeled components. We assume that Lm is a surgery 
presentation of M. Each of the labeled components represent a component of the 
framed link L'mM. The labels correspond to the representations labeling Wilson loops 
in the heuristic description. 

The more general invariant is defined by 

t(M,L) := (p-) a(LM) V~ a(LM) - c - l F(L M ,L). 

We define, 

(28) Z(M):=t{M,Q)), and W Rl ,..., Rc (L) := t{S 3 ,L)/t(S 3 ). 

to be the mathematical interpretations of the physically motivated invariants discussed 
earlier. 

It may appear that we have a reasonably simple definition of partition functions in 
(16.35); however it depends on the choice of a strict modular category and we do 
not have any other than Li(l)2, n +i yet. The invariants corresponding to U(V)2m+\ are 
heuristically the same as the invariants 'defined' via the path integral by integrating 
holonomies over a space of U(l) connections. Since U(l) is Abelian, the cubic terms 
in the Chern-Simons invariant vanish so the resulting theory is what physicists call a 
Gaussian theory. This is the case where the path integral would be easiest to formalize 
mathematically but it leads to fairly weak invariants. In the next subsection we define 
quantum groups because their representations lead to more complicated strict modular 
categories that in turn lead to more interesting quantum invariants. 

We now compute the invariant of S 3 in three different ways. Using the empty link 
we see that r(5 ) = V^ 1 . The following examples compute the same invariant from 
different link presentations. Note that there is some algebra involved in establishing 
that the obtained values are the same. 

Example 16.36 Let L be the link (twisted unknot) that defines p~ in Figure 16.10. 
We compute F(L) and t(Ml) in the category U{\)2m+\ ■ Coloring the single component 
by p and orienting the component clockwise we have 

J P (L) = ry o (0-1 ® id,,,) o u p = o p l = e- 2 ^ 2 /(2 m +D ) 
where p takes values 0, 1, ... , 2m. Since d p = 1 for all p, 

F ^ = E 2 p m =0 d pW = E 2 p m =0 e ~ 2nip2/{2m+l) - 
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Since L only has one component the linking 'matrix' is just the self-linking number 
which is — 1 because of the twist. The signature is also — 1 . The quantum diameter of 
U(V)2m+\ is I? = (2m + l) 1 / 2 (see Remark 16.24). By the very choice of L we have 
p~ =F(L). Thus 

T (M L ) = (p-y l V l - l - { F{L) = V~ l = (2m + ir 1/2 . 

Example 16.37 A slightly more difficult example is the left Hopf link from Figure 16.2. 
In fact, we already computed F(L) = £^ =0 e- 4m ™/ (2m+1) = 2m + 1 back in 
Example 16.2. Self-linking numbers in this case are both (no twists) and the linking 
numbers between the components are —1 (orientation!). Hence the linking matrix is 

■«-(- . "o') 

and a(L) = 1 — 1=0. Assembling the results we get 

T (M L ) = (p~)°V- 3 F(L) = (2m + l)" 3 / 2 e -^i Pq /(2m+i) = (2m + l)- 1 / 2 . 

p,q=0 

Exercise 16.38 Use the Reidemeister and Kirby moves to show that all three links 
employed above represent the same 3-manifold, namely the 3-sphere. 

To fully describe how the notion of a strict modular category relates to the Witten's 
Chern-Simons invariants one should describe the associated topological quantum field 
theory. The general outline is as follows. Diagrams with only closed components may 
be cut into parts with marked surface boundaries. A Hilbert space will then be associated 
to each such marked surface and a bounded linear map will be associated to a cobordism. 
The Hilbert space associated to a genus g surface marked with colors (simple elements 
of the category) V u . . . , V n is H = Hom(¥, (0. £/ Ay (8) A/)®* (g> Vi <g> • • • <g> V„) . There 
are many technical details that must be addressed in order to do this properly. A very 
careful explanation is given by Turaev [152]. 

Exercise 16.39 Compute the U(l)2m+\ invariant of the 3-manifold represented by 
the framed link from Figure 13.2. 

Exercise 16.40 Describe how to compute the W invariant of any framed link for this 
strict modular category. (There is a fairly simple formula in terms of the linking matrix 
and colors.) 
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17 Quantum groups and their representations 

The ingredients in a strict modular category look like the representations of some 
algebraic object. This is indeed one of the best methods to construct strict modular 
categories. In this subsection we consider the representations of deformations of Lie 
algebras called quantum groups. Deformations are constructed to arrive in a non- 
commutative setting. Without such a deformation the morphism that would be attached 
to a right crossing would be the same as the one attached to a left crossing. The invariants 
constructed from the resulting representations are the original Reshetikhin-Turaev 
invariants. 

17.1 Quantum groups at roots of unity 

The axioms of a strict modular category are very complicated, so without some additional 
motivation it would be difficult to construct an interesting modular category. Luckily 
the study of symmetry in quantum mechanics led to very similar structures and it was 
possible to construct interesting modular categories from quantum groups. 

The category that produces the usual quantum or Reshetikhin-Turaev invariants is 
a category of representations of certain quantum groups. The latter are algebraic 
objects that replace classical groups in description of symmetries on quantum (or 
noncommutative) spaces. The term 'quantum group' is used rather loosely and is 
usually reserved for deformations of algebras associated to classical groups. In particular, 
they are not groups; instead they generalize classical group algebras and enveloping 
algebras rather than groups. 

We begin this section with a brief description of the ideas that led to the discovery of 
quantum groups. As one would guess from the name the original motivation for the 
definition of a quantum group comes from quantum mechanics. However there is also a 
strong analogy between quantum groups and algebraic groups and much of the theory 
of quantum groups was first developed for algebraic groups. To quantize a mechanical 
system via canonical quantization one tries to find an embedding of a deformation of 
the algebra of observables into the linear operators on a Hilbert space (see Simms and 
Woodhouse [143]). The failure of operators corresponding to various observables to 
commute can be interpreted as the uncertainty principle. Often the Hilbert space can be 
taken to be a space of functions on the configuration space. 

Example 17.1 For a free particle moving in 1 -dimensional space the observables are 
functions of position x and momentum p . A possible Hilbert space to associate to this 
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ih^- to the observable 
ih. Thus L x and L p do not commute. However in 



system is the space of L 2 functions in the variable x. One then associates multiplication 
by x, denoted by L x , to the observable x and the operator L p 
p. Notice that one has [L x ,L p ] 
the classical limit h —> one recovers the classical algebra of observables. In most 
everyday situations this is a reasonable approximation because H = 1.055 x 10~ 34 
joule-sec. 

For the case of maximal symmetry the configuration space is a group. When the 
underlying configuration space is a group the space of functions on the group can be 
given the structure of a Hopf algebra. Thus Hopf algebras appear naturally in quantum 
mechanics as algebras of functions on groups. 

Recall that a group can be described as a tuple (G, /u: GxG — > G,e: ¥ — > G,n: G — ► 
G) such that the following diagrams commute. 

id, (J, _ _ „ id,e _ „ _ _ id,n 



GxGxG 



;U,id 



GxG 



GxG 



Gx¥ 



GxG 



GxG 



GxG 



¥ 

G 



H id eop 

Here 5: G — > G x G is the diagonal map. If A is an algebra of functions on a 
finite group G, the multiplication \x identity e and inverse n on G will induce a 
comultiplication A : A — > A A , counit e : A — > C and antipode 7 : A — > A on A 
such that the following diagrams commute. 

id® A , _ , , _ ^, id®£ 



- A(gA A<g)C ■ 



■ A<gA A®A -= 



id® 7 



-A®A 



A®id 



A<g)A 



A id 

The comultiplication A and counit e are algebra homomorphisms and the antipode is 
an anti-homomorphism (j(ab) = 7(6)7(0).) This is essentially the definition of a Hopf 
algebra. 

Definition 17.2 A Hopf algebra A over a field ¥ is an associative algebra with 
additional algebra homomorphisms (counit, coproduct) e : A— > F, A : A® A and 
an algebra antihomomorphism (antipode) 7 : A — > .A satisfying axioms that dualize 
the usual axioms for the unit, product and inverse in a group (see the examples below, 
Chad and Pressley [40] or Majid [102] for the complete list). 
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For quantization one needs to deform the algebra of functions into a non-commutative 
algebra. Now consider the case of the group SL2C. The natural action of this 
group on C 2 will help motivate the correct deformation. The algebra of functions on 
C is just the ring of polynomials C[x, y] and the algebra of functions on SL2C is 
C[a, b, c, d]/(ad — be — 1). Recall that a map between spaces induces a map on the 
associated function algebras going in the opposite direction. Now the natural action 
of SL2C on C 2 by matrix multiplication induces the corresponding map of function 
algebras 

C[x,y] -» (C[a,b,c,d]/(ad-bc- 1))«C[ij]. 

To arrive at a non-commutative deformation of the function algebra of SL2C (that is, 
C[a, b, c, d\/{ad—bc—\)), start with the non-commutative complex plane. The function 
algebra of the non-commutative plane is C{;t,y}pz]]/(.ry — e~ h yx). Here C{.x,y} is the 
free algebra on two generators over C and R\h\ refers to formal power series in the 
variable h with coefficients in R . In order to keep track of all of the important algebraic 
structure one must consider the Hopf algebra structure on the resulting algebra. 

There is essentially a unique Hopf algebra that respects the map of function algebras 
induced by the action of SL2C on C 2 with the algebra of C 2 replaced by the non- 
commutative version and the determinant replaced by ad — e~ h bc. This algebra is 
denoted by SL^C. Such algebras or their duals are called quantum groups (sic!). The 
quantum group U q {shC) with q = e h is 'dual' to the deformed function algebra SL^C. 
See Chad and Pressley [40, Chapter 7] for more details. 

Because the axioms for operations and co-operations in a Hopf algebra are symmetric 
one can define a dual Hopf algebra A* (in a couple of ways) switching them, that 
is, the product in the dual comes from the coproduct in the original, etc. We will 
be mostly interested in the duals of the deformed group algebras such as 5L|C. For 
simply connected Lie groups these duals or distribution algebras can be described 
as deformations of the universal enveloping algebras U(q) of the corresponding Lie 
algebras g. Our main example comes from deforming £/(s[#C). 

Many Lie algebras are matrix algebras with bracket given by [X, Y] = XY — YX. Any 
Lie algebra g can be embedded in a unital associative algebra so that the Lie bracket 
takes this form. Recall that the universal enveloping algebra t/(g) is the quotient 
of the tensor algebra © n >og®" by the ideal generated by x (g) y — y (g) x — [x, y\ with 
G g. In a form more germane to quantum generalizations U(q) can be described by 
generators and relations. Namely, if jci , . . . , x n generate g with relations given in terms 
of brackets then U (g) has a presentation with the same set of generators with relations 
obtained by replacing all brackets with the corresponding commutators. For example 
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[x, y] is replaced by xy — yx. We will usually write brackets even when working in t/(g) 
interpreting them as commutators. 

Before discussing the deformations recall the presentation of the enveloping algebra. 
Exercise 17.3 Compute [e,f], [L,e] and [L,f] for the following matrices. 

o o) ' /= (i o) ' L= (o -1 

Exercise 17.4 Express [X, [X, Y]] as a product of X and Y factors. Let Ejj be the 
square matrix with a 1 in the i,j entry and zeros elsewhere. Compute [Ei t i+i,Ejj+t], 
[Ei t i+i,Ej + ij] and [E iji+ i,Ejj — Ej+ij+i]. 

Example 17.5 Recall from Fulton and Harris [62], Humphreys [79] or Appendix D 
that sInC is generated by a- and i = I, ... ,N — I. Here af represent the simple 
coroots of sInC and are the corresponding positive and negative root vectors. In 
other words e, = £i,;+i , /; = ^z+i,; and a/ = Ejj — Ei + \j + \ . The relations in sl^C 
and £/(s[atC) are: 

[a/, aj] = VeiJj] = <%a/, 

(29) [a] / ,e j ]=a ij ej \e u ej] = ]fi,Jj] = 0, 

\- a ) ' Jj\ = - a ijfj i e h \?u «]]] = \fu \fufj]] = 0, 7 = i ± 1, 



where 



dij = Tr(a^ a,- ) 



o |f-y|>i 

-1 \i-j\ = l 
2 i=j 



is the Cartan matrix of s^C. The last pair of relations are known as the Serre relations. 
The Serre relations look more familiar if written in the associative form 

(30) ejej - 2e i e j e i + ejej = jffj - 2/0 ■ 0. j / : 1 . 

£/(s[atC) also has a Hopf algebra structure given by 

e(x) = 0, 7(x) = -x, A(x) = 1 <g) x + x <g> 1 . 

For s^C there is only one generator of each type e,f, ot y , the Cartan matrix is 1 x 1 , 
that is, the number 2 and the Serre relations trivialize so (29) reduces to 



[a v ,e]=2e, [a v J] = -2f [e,f] = a 



v 



Qeometry & Topology Monographs 8 (2006) 



Introduction to the Gopakumar-Vafa Large N Duality 



343 



Following the motivation above we now explain the quantum deformation t/^s^C). 
According to V Drinfeld [51] the first two relations in (29) should stay intact whereas 
the last one should become 

sinh(fta v ) 

[e,f\ = ■ , rk , , 
sinh(Ai) 

where h is a formal parameter (the Planck constant). To make sense of this we would 
have to consider formal power series in h with coefficients in {/(sfeC) which is not 
very convenient. Fortunately, there is a bypass due to M Jimbo who suggested setting 
q = e h and introduced two new generators q ±a so that the last relation becomes 

q-q 

Now we only have to extend the field of coefficients from C to C(q) (rational functions 
in q). Of course we now need to eliminate a v form the first two relations above which 
leads to 

q a eq a = q z e, q a fq a =q J. 

Note that q ±aV are indeed new generators and by no means a variable q 'taken to 
the power' ±a v (some authors denote them K ±l to prevent confusion; see Chari and 
Pressley [40] and Majid [102]). It is interesting that the representations of t/fs^C) 
and £/(sl2C)[[/i]] are in bijective correspondence [40]. Moreover, any deformation of 
[/(sfeC) and more generally U(q) as an associative algebra is trivial, that is it produces 
an isomorphic algebra [40]. It is in the Hopf algebra structure that the difference 
between U and U q becomes essential. For {/^(sfeC) the deformed co-operations are 
given on the generators by 

-q ±aV ®q ±a \ 
:/<g> \+q- aV ®/. 



e(q ±aV ) 


= 1, 


7(«? ±aV ) 


= q^\ 




8(e) 


= 0, 


7(e) 




A(e) 


e(f) 


= 0, 


7(f) 


= -^7, 


A(f) 



As Hopf algebras {/(sfeC) and U q (sli l C.) are not isomorphic. 

One would expect that L^fs^C) is a deformation of {/(sfeC) but this is not quite true. 
What is true (see Kassel [83]) is 

U(sl 2 C) = U q=1 ( 5 l 2 Q/(q aV - 1). 

We now give the general definition for L^sl^C) following Chari and Pressley [40]. 

Definition 17.6 The (rational form of the) Drinfeld- Jimbo quantum group £/ £y (sl/vC) 
is the Hopf algebra generated as an associative algebra over C(q) by the generators 
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q ' ,ei,fi with i = \, . . . ,N — \ satisfying the relations 

(31) q a ^e jq - a ^=q a "ej, {e u ej\=\f u fj\=0, 
q aVi fjq~ aW, =q~ aiJ fj, efej-ig+q-^iejet+ejef^, /=»±1, 

fifj-{q+q- 1 )Mfi+0=O, M ± 1 • 

Here a,y = Tr(a^aJ') is the Cartan matrix of styC. On the generators the counit, the 
antipode and the coproduct are given by 

e(q ±aV -) = 1, 7 (<7 ±aV ') = ^ V <, A(<7 ±qV ') = q ±aV - ® 4 ±a \ 

(32) £(«*) = 0, 7(e/) = -«i?" aV S A Oi) = e« ® 9° V ' + 1 ® e,-, 
£(/«•) = 0, Wd = -q aV % &(fd=fi®l + q- aVi ®fi- 



Remark 17.7 Notice that for sIa?C the coroots and the roots can be identified. The 
reason for using the notation that we used here is that it generalizes automatically to a 
quantum group constructed from any semisimple Lie algebra. 



The comultiplication is used to construct the tensor product in the strict modular 
category constructed from representations of quantum groups, the counit leads to the 
unit, the antipode leads to the duality and the square of the antipode leads to the twist. 
Constructing the braiding is a bit tricky. We address these issues later in this section. 

One other tricky point is making sure that there are only a finite number of simple 
objects. This is accomplished by specializing to a root of unity; however, this is not 
as easy as one might guess. It is tempting to consider UqislffC) over C rather than 
C(q) by specializing q to a particular complex number z- When z is not a root of 
unity one obtains a Hopf algebra with generators and relations given by (31) and (32), 
and q replaced by z (not in q ±ai since those are names of generators). As associative 
algebras U z (sInC) and U(sInC) are isomorphic and thus have identical representation 
theories. As nice as this might be it means an infinite number of simple objects 
(irreducible representations) and no hope of modularity. To understand why the case 
Z = e, e = 1 with / an integer is exceptional we need to introduce the notions of 
g-integers, ^-factorials and g-binomials. 

Definition 17.8 For any n£Z define the ^-integers (quantum integers) as 

q" - q~ n „ 

[n] q := V G C (9) 

q-q' 1 
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and for n > define the ^-factorials as 

[n] q \ := [n] q [n-l] q ...[2] q [\] q . 
For any pair of integers < m < n define the ^-binomial coefficient as 

[n] q \ 
[n - m] q ![m] q l' 

Exercise 17.9 Verify that [n] q = q m [n — m] q + q~^ n ~ m '[m\ q and use it to derive the 
Pascal recurrence relation for g-binomials: 



" n 




~n - r 


+ q 


—(n—m) 


' n- r 
m — 1 


.m. 


q 


m 







Obviously, in the limit q — > 1 the ^-numbers turn into the ordinary integers, factorials 
and binomials. Many formulas in representation theory involve multiplication by 
n\ . Under quantum deformation these become multiplications by [n] q \ . However, if 
e l = 1 and we specialize to q = e then [/'],,! = for e 1 ' = ±1 and some irreducible 
representations will become reducible. For e = ±1 the defining relations in (31) do not 
even make sense as written due to division bye — e~ ! =0. But even for / > 2 simply 
setting q = e 27 "/' will not produce the 'right' version of the quantum group. The right 
version was introduced by G Lusztig [100]. 

Lusztig notes that [n] q = Yl'l=o c l n ~ l ~ 2k e ^[<7><Z -1 ]> that is, it is a Laurent polynomial 
in q, and therefore so is [n] q \. Less obviously, the ^-binomial [" n ] q is also a Laurent 
polynomial. 

Exercise 17.10 Prove the last claim. 

Hint: Use the Pascal recurrence and proceed by induction on n and m. 

The Laurent polynomials play the same role in C(g) as the integers play in C . More 
importantly, for any 7r G l*[q,q~ l ] the value tt(z) is well-defined for any z 7^ 0, in 
particular, q can be specialized even to roots of unity in Laurent polynomials. This 
suggests defining an integral form of U q (g) before specializing to roots of unity. For 
the classical enveloping algebras such a form is known as the Kostant Z-form (see 
Humphreys [79]) and it uses the divided powers -V, i as generators. This motivates 
the following definition (see Lusztig [100]): 

Definition 17.11 The divided powers in U^sInC) are defined by 

n rn 
(n) ._ gj Art) . = Ji 

1 ' [n] q V Jl ' [n\ q V 
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The trick now is to rewrite the relations (31) in terms of the divided powers and make 
sure that their coefficients are Laurent polynomials. This is indeed the case and we give 
some of the relations below (the full list occupies an entire page in Chari and Pressley 
[40]). 



q a 'ef ] q- a ' 
-CtiAn)—cti 



(34) 



nay An) 



q f} q 

e [n)j(m) _y( m ) e («) 



e (m) g (n-m) 



(hi) An—m) 



fifi 



fi 



in) 



e (m) e (n-m) _ ' jtn—m—j) 



q a J; 2j-n 



In the last formula we used a new notation 



(35) 



q a '\ c 



if 



q c+l-k q ai 



q -q 



-(c+l-k)q-ai 
k 



with c£Z and j G Z>o- These new elements are not generators, they can be expressed 
via e)r,ff with Z[q, q~ x ] coefficients (see Lusztig [100]). We mention them because 
they will play an important role in the representation theory later. Once we have the new 
relations we can forget about the origin of the divided powers and treat them as formal 
symbols that satisfy the relations (34). Indeed, we have to do this since Definition 17.1 1 
makes no sense for q = e a root of unity. 



Definition 17.12 (Quantum groups at roots of unity) The restricted integral form 
^ilq 9 -i]( s ^C) of the quantum group U q (slffC) is the Hopf algebra generated over 
Zh?,^" 1 ] by # ±a '' , ey \jf . The corresponding quantum group at a root of unity 
L^ 6S (51atC) is obtained by specializing g to e and changing the coefficients to C. 
Formally, 

£/f s (sI*C) := U™ q ^p[ N C) ® zlM -i] C , 
where Z[g, q^ 1 ] acts on C in the obvious way with 7r 7r(e). 

Remark 17.13 The algebra t^fj? ^(sltfC) has a presentation with generators q ±ai , 
e\ ,Ji and relations including (31) and co-operations induced from (32). 

At this point the whole digression on divided powers may seem superfluous: why not 
simply take the subalgebra of f/ 9 (sljvC) generated over Z[q, q~ l ] by q ±a < , and 
then specialize to e? Intuitively, the difference is due to the following. In Uqisl^C) we 
have the equality e" = [n] ? !e/ . Since this equality only contains a Laurent polynomial 
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it continues to hold in U 1 ^ ^-ijCsIjvC) and therefore in L^ es (sl;vC). However, if e = 1 
then [n] e \ = for n > 21 and the higher powers of et,fi vanish while the divided 
powers survive! Otherwise there is no way to define divided powers and the quantum 
group simply looses part of the structure. On a bright side, for < n< //2 we have 
[n] e \ ^ and the quantity e\ = e"/[n] e \ is well-defined even in L^ es (st/vC). This 
simplifies computations with the divided powers in this range and allows one to use 
simpler relations (31) instead of (34). 

17.2 Representations of U r € e s (sl N C) and tilting modules 

The category of all finite-dimensional representations of L^ es (5(^C) is still too large 
to be modular. What we need is the category Tilt e of the 'reduced tilting modules'. 
This is a suitable subquotient of the category of representations of U v ^{sInC). This 
means that we will consider only some of the representations (the tilting modules) and 
construct Tilt e by quotients of these by 'negligible' parts (reduced). In this subsection 
we define the subcategory of tilting modules. Here the word module simply means 
representation space of L^ es (s[;vC). We define the category of reduced tilting modules 
in the next subsection and defer the modular structure until even later. 

Much of the early work on these representations was done by algebraists (see Lusztig 
[100] and Andersen [9]) interested in the representation theory of algebraic groups in 
positive characteristic. They introduced the terminology which became standard. We 
adopt it here despite the fact that it is not customary in the representation theory of Lie 
groups and algebras that are the closest classical analogs. Since prime roots are the 
most interesting for algebraic groups the theory was originally developed for odd roots 
of unity. This makes the algebra somewhat easier (see [9] and Chad and Pressley [40]). 
On the other hand, for topological applications one has to consider the even roots of 
unity. In particular, the correspondence with the SUn Chern-Simons theory at level k 
requires the order of the root to be / = 2(k + N). More seriously, the category Tilt e for 
odd roots of unity is not modular, the non-degeneracy Axiom 17 fails(see Sawin [136]). 
This problem can actually be fixed by 'modularization' but this was realized much later 
and involves additional technicalities (see Bruguieres [35]). 

To keep track of the difference between the even and odd cases we define 

, 1 1 I even 
i :={ 

[1/2 /odd 

Said in a different way I' is the smallest positive integer such that e l = ±1 where e is a 
primitive Zth root of unity. 
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For a while in the 1990s there existed a well-developed representation theory for 
odd roots of unity that did not lead to a non-degenerate S-matrix. There was also a 
non-degeneracy proof for even roots (see Turaev and Wenzl [153]) but no corresponding 
representation theory. Thus in papers and monographs written in the 1990s authors 
either did not treat 3 -manifold invariants at all [40] or implicitly assumed that the 
representation theory transfers from the odd case to the even (see the work of Bakalov, 
Kirillov, Reshetikhin and Turaev [21, 84, 152]). There is still no single source where 
all the required algebraic facts are stated and proved in the correct generality. We will 
state results in a form that works for both cases but a reader interested in connections to 
Chern-Simons theory may safely assume everywhere that / = 2(k + N). 

Representations of the classical enveloping algebra U(g) are of course the 'same' as 
those of g meaning that every representation of the latter extends to one of the former. 
The representation theory of U q (g) after specializing to q = e includes some subtleties. 
To stimulate intuition we begin by recalling how a classical irreducible representation 
of sfeC with the highest weight A G A+ is constructed. Looking at Appendix D first 
for notation and basic results from classical representation theory may help. 

Example 17.14 Let q = s^C and A = u> = ha, where a = E\\ — E22 is the (unique) 
simple root of s^C and u> is the corresponding fundamental weight (here and below 
we identify the Cartan subalgebra f) with its dual h* via the Killing form). Recall from 
Example 17.5 that the generators and relations for sfeC are e,f,a: 

(36) [a, e] = 2e, [a,f] = -2f, [e,f] = a. 

Let uo be the highest weight vector. By definition 

euo = 

auo = (u>, a)uo = \{a, o)uq = u$. 
Now set u\ \=fuQ and compute using (36) 

eu\ = efiiQ = ([e,f] +fe)uo = auo + = uo 

au\ = afuo = {[a J] + fa)u = —2fu Q + fu = -u\. 

An analogous computation shows that setting 112 := fu\ leads to eii2 = and 
au2 = — 3«2- It follows that we could set fu\ = and obtain a well-defined 
representation. Furthermore the representation must be simple because the orbit of any 
non-zero vector under sfeC generates the whole space. Thus, the representation space 
V w is spanned by uq, u\ . In the uq, u\ basis we have 

e= (°o 0)' a= G -1)' f= {i 0)' 
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which one easily recognizes as the defining representation of sfeC. 

In general, given generators e,-,^-,a; of sl^C and a dominant weight A G A+ with 
highest weight vector uq one has eiuo = 0, o/ M o = (A,a,)«o- This is exactly the 
classical Verma module V\ where «o is the equivalence class of 1 . We keep generating 
new vectors Ui lr __ ; i m := fi m . . .f^uo in lexicographic order and keep computing the 
action of the generators on the new vectors using the commutation relations. When we 
find the largest proper set of vectors generated by these and the sl#C action it will be a 
maximal proper ideal of the Verma module and the quotient will be the maximal abelian 
quotient. Said differently, we set the action of/] to on the last vectors produced and 
obtain an irreducible representation V\(sInC). More formally, we take the quotient of 
the space spanned by all (infinitely many) uo, u^..,^ by the maximal invariant subspace 
under the action of e\J^ a, •. In Example 17. 14 the maximal proper invariant subspace 
is spanned by U2, m, . . . and this is why taking the quotient reduced to setting fu\ = 0. 

Exercise 17.15 Show that the basis of V^s^C) formed as above is uq, u\,u\2 by 
computing the action of ei,fi, a; as described above. 

Now we wish to apply the same approach to U q (shC). 

Example 17.16 The generators now are q ±a ^ , e,f and the relations (36) get replaced 
by 

(37) q eq = q e, q fq = q f, [e,f] = 

q-q 1 

As in Example 17.14 we have 

q aV uo = q^ ,OL • > «o = quo 
euo = 0, 
fu Q =: m 

q u\ = q fq q uo = q J ■ quo = quo, 

q aV -q~ aV 

eu\ = efuo = ([e,f] +fe)u Q = z^«o + = u . 

q-q 

V -i 

An analogous computation for 112 := fu\ shows that q a 112 = q~ U2 and eu2 = 0. 
Hence we should set/wi = then mq, u\ form a basis of V2(sl2Q- 

It is customary for sfeC to use the basis of the divided powers v; := j^Ui = \j l uo as a 
canonical one. One can show along the above lines (see Kassel [83]) that in this basis 
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the action of sfeC and {/(sfeC) in the representation V^-i^s^C) is given by: 

a v v, = (m- 2i)Vi, 
evi = (m — i + l)v,-_i, i = 0, 1, . . . ,m — 1 
>i = 0'+ l)v,- + i. 

This generalizes straightforwardly to the quantum case, where ordinary numbers 
are replaced by ^-numbers from Definition 17.8. The corresponding representation 
V ( ^_ 1)a ,(s[ 2 C) of UgQshQ is given by Chari and Pressley [40] 

(38) evi = [m- i + l] ? v,_i, i = 0, 1, . . . ,m - 1 

fvi = [i + l] 9 v i+ i- 

Representations of interest to us are constructed from the so-called Weyl modules that 
are the e analogs of the V* representations from Example 17.16. Their construction is 
largely parallel to the construction of the corresponding classical representations but 
with important caveats. 

Definition 17.17 Let A r be the root lattice of sl^C i.e. the lattice generated by all 
a\ = Efi — E* +u+l and let <ft '■ A r — > Z 2 be a homomorphism. A [/^(slArQ-weightis a 
pair (A, <p) with A an ordinary sljvC weight and <p a homomorphism. A weight vector 
in a UqislfjC) representation is a vector v^\,4>) sucn th at v (\<t>) = <f>((Xi)q^ a ' )v (A,0)- 
A type I representation is one with <fi being the trivial homomorphism. 

Remark 17.18 Only the type I representations of Uqisl^C) have classical analogs. 
From here forward we only consider type I representations. 

Definition 17.19 (Weyl modules) Let A be a dominant weight and let Z\ be the left 
ideal of U q (s^C) generated by e; and q ±a > — q ±X( - a i \ The Verma module is the 
quotient 

V q x (st N C) := U q (sl N C)/I x . 

Denote by V|(sljvC) the quotient of V|(s[jyC) by the maximal invariant subspace with 
the induced action. The restricted integral form of this representation V^' r<?s (s^C) is the 
U% - t] (sl N C) submodule of Vl(sl N C) generated by 1. The Weyl module W{(sl N C)\s 
the vector space over C generated from V|' res (slArC) by changing coefficients from 
Ij[q,q~ l ] to C (7r i — ► vr(e)) with the action of [/^(sljvC) obtained by specializing q to 
e. Formally, 

Wl(sl N C) := V|' res (st w C) ® Z[M -i] C. 
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This definition seems a bit convoluted but it is in essence parallel to the definition of 
t/^ es (slArC) itself. We are trying to avoid the trivialization of the powers of et,fi by 
making sure that the divided powers are present as 'independent' quantities. The space 
V?(s[jvC) is just the vector space over C(q) generated by uq, ui u ,,,j m for it = 1, . . . ,N— 1 
with the action of £7^(5 IatC) determined by 

em = 0, q ±a 'u = q ±(X ' a) u , fm u ...j m = u iu ...j imi . 
To get to the Weyl modules we restrict to an integral form then specialize coeficients to 



Example 17.20 Recall from Example 17.16 that V^jv/sfeC) is spanned by vo, . . . , 
v„,_i . Iterating the action (38) we obtain 

[m — i + ri]q\ 

e n Vi = [m - i + \\ q . . . [m - i + n\ q v,_„ = — — — v,-_„. 

[m-i] q \ 

and analogously for/". Using Definition 17.11 of the divided powers and (33) of 
^-binomials we get 

m — i + n 
n 

i + n 
n 



(39) 

fV> Vi 



Vi-n, 

1 



v i+n- 

1 



Since ^-binomials are in Ij[q,q~ l ] we see that vo, . . • , v m -i also form a basis of 
^(m-i>j( s W W ^ tn ^ e act i° n given by (39). Thus by Definition 17.19 the Weyl module 
WL_ ^(sfeC) is spanned by the same vectors and the action of [/^(s^C) on them is 
given by replacing q with e in (39). It can be shown (see Chari and Pressley [40]) that 
WL jv^CsfeC) has an invariant subspace Wi nv unless m < I' or m = /' — 1 mod 
see Exercise 17.27. This means that for other values of m not only is WL 1)a ,(s[2C) 
not irreducible but it is not even a direct sum of irreducibles. Thus, complete reducibility 
of the classical representations that still holds for t/ 9 (s[jv<C) is lost for L^ es (slArC). 

This example demonstrates an important method of doing computations in L^ es (5l^C) 
that avoids using its complicated relations directly (and this is the reason we did not give 
a complete list of them in (34)). This idea will be used again and again in the sequel. 

To obtain an equality in t/^ es (st/v(C) perform all the computations in 
U q (3lffC) using (31) and rewrite the end result in terms of the divided 
powers so that it only contains Laurent polynomials as coefficients. Then 
specializing q to e gives an equality in Uf s (slNC). 
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Before dealing with the loss of complete reducibility we have to address a more basic 
problem with the definition of a weight space. Recall from Appendix D that classically 
a vector v has a weight A G A w if a,-v = (A,a,-)v for all simple roots a,-. For g 
an indeterminate this generalizes straightforwardly to the quantum case by setting 
q±oa v — q±(X,ot{) v instead. This also works when we specialize q to a generic complex 
number e. If however e is a root of unity and j3 G A r we have e ( A +'ft a <) = e ( A > a <) an( j 
A would only be defined modulo lA r . 

The underlying reason for the weight ambiguity is that for roots of unity the maximal 
Abelian subalgebra of U x ^ is no longer generated by q ±ai [40]. The additional 

q y J . Note that substituting e into 
'definition' (35) leads to a meaningless expression. To make sense of these elements in 
L^ es one has to reexpress them in terms of the divided powers. Below we incorporate 
these new elements into the definition of a weight space so that the weight is now 
well-defined. 

Definition 17.21 Let V be a representation space of t/f s (s(;vC). We say that v G V 
is a weight vector with weight A G A^ if 



q a ';0 
I 1 



(A, a;) 
V 



where on the right we have the ^-binomial coefficient (33) rewritten as a Laurent 
polynomial and specialized to e . We denote the subspace of vectors in V with weight 
A by V A and call it the A-weight space. If V = ©AeA„ V A the righthand side is called 
the weight space decomposition of V . 

Remark 17.22 Notice that we are using V\ to represent the irreducible representation 
with heighest weight A and V A to denote the A-weight space of a representation V. 
Generally we will use calagraphic fonts to denote representations of quantum groups 
and roman fonts to denote representations of classical algebras. 

By (34) we have 

q a J(e ( j n '>v) = (q a Je<j n) q- a J)q a Jv = q naji q^ X ' ai) v\q= e 

_ e «(a,,a,)+(A,a,) _ e (\+nai,aj) v 

As we explained this in itself does not mean that e\ n \ has the weight A + naj but one 
can show using the full list of relations in Lusztig [100] or Chad and Pressley [40] that 
indeed 

e W(y x ) C v x+nai , fi H \V x ) C y x ~ na \ 
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With this notation we have a very important result that follows from the definitions by a 
deformation argument [40] : 

Weight space decompositions of V\, V?, are the same, that is, then- 
weights and the dimensions of their weight spaces are equal. 

Since weight space decompositions of classical representations are well-known this 
observation comes handy when performing computations with representations of 
quantum groups. 

Circumventing the lack of complete reducibility is not as simple. Ultimately, we will 
have to restrict to the class of admissible representations for which complete reducibility 
still holds. This will lead to the desired finite number of irreducibles. To proceed in this 
direction we need to introduce the notions of the dual of a representation and of the 
tensor product of representations. Recall that given vector spaces U, V with a linear 
action of a Lie group G the actions on V* and U (g) V are given by 

gf(v) -=f{g~ l v), g(u ® v) := gu <g> gv. 

Quantum groups Uqisl^C), L^ es (s[^C) are neither Lie groups nor Lie algebras but 
Hopf algebras so the inverse and the tensor action are replaced by the antipode and the 
coproduct respectively. 

Definition 17.23 Given representations A — > End(^) , A — > End( V) of a Hopf algebra 
A (see Definition 17.2) the dual representation on V* is given by af(v) := f(j(a)v) 
and the tensor representation on U (g) V is given by a(u (g> v) := A(a)(u ® v). The unit 
representation ¥ := A —* End(C) is given by az '■= e{a)z- 

This definition works because 7, A are an antihomomorphism and a homomorphism 
respectively. For Lie group algebras and universal enveloping algebras with their usual 
Hopf structure we recover the standard definitions of the unit (trivial representation) 
dual and the tensor product. This provides the unit, dual and tensor product in the 
modular category that we are defining. 

Recall from Appendix D that for classical Lie algebras all irreducible representations 
are indexed by dominant weights. Since obviously the dual to an irreducible is an 
irreducible we get a duality involution on the set of dominant weights A+. For sI#C 
it can be easily described explicitly (see Fulton and Harris [62] or Humphreys [79]). 
Let wq= ( x N _ x ' ^2 1 1 ) be the order-reversing permutation then ~ V_ Wo (A) and 
this is an involution because Wq = id. 

Exercise 17.24 Check that this indeed works for a couple of representations of SI3C. 
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More importantly for us, this carries over to the quantum groups, in particular, 
(40) Vf (st N C) ~ V q _ Wo{X) (sl N C). 

The next definition introduces the type of representations that we will use to build our 
modular category. These 'tilting' modules were originally introduced and studied in 
the context of algebraic groups. In fact many of the proofs refer to facts that are true 
by analogy with results from algebraic groups, and we do not know of a reference that 
addresses the representation theory of quantum groups that we need without assuming 
familiarity with algebraic groups. For example Chari and Pressley refer to algebraic 
groups as the 'classical' case [40]. See HH Andersen [9] for more history and some 
important results related to these modules. 

Definition 17.25 (Tilting modules) A representation V of ^ es (sl^C) is said to 
have a Weyl filtration if there exists an increasing sequence of invariant subspaces 
= Vq c Vi C • • • C V r = V with Vi/Vi-i ~ W e x .(sl N C) for some dominant weights 
A, E A+ . A representation V is called a tilting module if both it and its dual V* have 
Weyl nitrations. The category of all tilting modules is denoted by Tilt e (5i^C) 

Note that if we had complete reducibility, the existence of the Weyl filtration is equivalent 
to V being a direct sum of W x -s. However, such direct decomposition does not hold 
in general, that is, the filtration 'tilts'. In particular, the tilting modules still are not 
completely reducible and we need a condition weaker than irreducibility to describe the 
'elementary' tilting modules. 

Definition 17.26 A representation V is called indecomposable if it does not split into 
a direct sum of two proper invariant subspaces V ^ V\ © V% ■ 

It follows from Example 17.20 that in the s^C case every Weyl module is indecompos- 
able. Also every Weyl module obviously has a trivial Weyl filtration. However, not 
every Weyl module is tilting. The problem is that its dual does not necessarily have a 
Weyl filtration. This is closely related to the fact that some Weyl modules are reducible. 
The following exercise provides a good example to think about when studying these 
issues. 

Exercise 17.27 Take N = 2 and & = 3 giving / = 10 and e = e ni/5 . Show that 
the subspace of Wf^staC) generated by/( 3) wo and/ (4) Mo is an invariant subspace 
([5] e = 0) so that W| Wi (sl2C) is reducible. Use this invariant subspace to explicitly 
construct a non-split extension 

-> W^OfeC) - Qf wi -> W^OfeQ - . 
The module Qj UJ] is the unique indecomposable tilting module of weight Iloi . 
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Nonetheless, we have the following major theorem originally due to H H Andersen [9] 
for / odd. The case of even / should follow the general arguments of [9] but it is not 
stated explicitly there. 

Theorem 17.28 Direct sums and summands, duals and tensor products of tilting 
modules are again tilting modules. For every X £ A+ there is a unique indecomposable 
tilting module Q e x with the highest weight A and one-dimensional highest weight space. 
Moreover, every tilting module V admits a decomposition 



with multiplicities m\(V) canonically determined by V and only finitely many of them 
non-zero. The dual is given by Q\* = SI ^ . 

Thus, tilting modules form a subcategory Tilt ^sInC) of the category of representations 
of L^ es (s[jv<C) with morphisms being equi variant maps/(av) = af(y), which is closed 
under duality and tensor products and is dominated by indecomposable objects Q x . 
In a way, the tilting modules resemble the classical representations much more than 
general representations of t/^ es (s[yv<C) do. However, there are two difficulties that 
prevent TiK e (stj\fC) from being modular as is: Q e x are not exactly simple objects being 
indecomposable but not irreducible and there are still infinitely many of them. Based 
on ideas from physics or more likely ideas from algebraic groups Andersen was able to 
resolve both problems by discarding tilting modules of quantum dimension 0. This is 
explained in the next subsection. 

17.3 Quantum dimensions and the Weyl alcove 

We do not have a ribbon structure on Tift e (s[jvC) yet, so defining quantum traces and 
dimensions as it was done in Section 16.4 is not possible. Recall however that quantum 
traces were reinterpreted in Section 16.6 in terms of double duals. Namely, given a 
double dual isomorphism V V** one can set Ttqif) := Hy* o (Sy of <gi idy*) o Uy 
and dim q (V) := Tr^idy), where Ply*, Uy are the standard pairing and copairing for 
vector spaces. Any candidate for 5y must of course be equivariant under the U r e es action. 
The standard identification of V and V** in the category of finite-dimensional vector 
spaces v [v] with [v](^>) := ip(y) is not equivariant. Indeed, we have 

(a[v])(p) = [v](7(a)v?) = ( 7 (fl)^)(v) = ^\a)v) = ( 7 2 (a)[v])(^) 

and one easily sees from (32) that 7 / id. The idea is to 'fix' [•] to make it equivariant. 
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Proposition 17.29 Let p := \ J2aeA+ a ^ e tne Weyi weight (see Appendix D). Then 
for a in Uqisl^C) or in U^ s (sInC) we have 

(41) 7 2 (a) = q 2p aq- 2p 

and the double dual map 5y '■ V — > V** given by v m [g 2p v] is an equivariant 
isomorphism. Consequently, 

(42) Tr ? (f) = Trfo 2 '/), 
where Tr is the usuai trace. 



Proof Since both sides of (41) are algebra homomorphisms it suffices to check the 
equality on the generators q ±a %e\J{. We have 7 2 (<7 ±a ') = q Ta ' by (32) and the 
equality is obvious. For e k we have from (31) and (32): 

l\e k ) = 7 (-^- qV *) = _ 7 (^-« v *) 7te ) = q <* V *e k q- aV * = q 2 e k . 

On the other hand q 2p e k q~ 2p = q^ ak ' p ^e k = q 2 e k = j 2 (e k ) as claimed. (Recall from 
Appendix D that p = ^Uj.) In Uqisln/C) the equality for the divided powers ejj. 
follows by dividing both sides by [n\ q \. The case of fjf 1 ' is analogous. Since (41) does 
not involve any rational functions of q it remains valid in _ij(sIjvC) and therefore 
specializes to t/^ es (slArC). The equivariance is now straightforward 

a8 v (v) = a[q 2p v] = [ 7 2 (a)<? 2p v] = [q 2p av] = S v (av). 

Finally, for the quantum trace we get 

Tr ? (f) := rv o (Sv of ® id y «) (^2 n ® v*J = rv (j>2[q 2p f(v k )] ® v 4 




This completes the proof. □ 

Remark 17.30 Something interesting is happening here. We are starting with a 
representation of a deformation of the universal enveloping algebra of sIa?. In order to 
define the appropriate deformation we introduced elements such as in place of the 
coroots. Furthermore, because we are exponentiating {q = e 2 ""'/' after specialization 
giving qP = g 27 "^/') these are actually elements of the group SU(/V) so the characteristic 
numbers that we compute here can be expressed in terms of characters of the group 
SU(/V). The Weyl character formula can then be used to compute the resulting 
characters. 
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Recall from the previous article that the Weyl modules W|(sl#C) have the same weight 
space decompositions as the classical representation spaces V\ . The quantum dimension 
is the quantum trace of the identity, and it is given by the Weyl character formula (see 
Appendix D). 

Corollary 17.31 The quantum dimensions of the space W\ is given by 

n (A+p,a) _ -(A+p,Q) 

aeA+ 

Exercise 17.32 Since e~ 1 = e it is obvious that dim^W^ is real. Prove that in fact 
dim^VV^ > if e is a primitive root of unity. 

It is obvious from (43) that dim^lA^ = if and only if (A + p, a) is divisible by /' for 
some a G A + . Therefore, for A in the range < (A + p, a) < 1' for all positive roots 
a the quantum dimension of is non-zero. 

Example 17.33 For the Weyl modules W ( e m _ 1)w (s[ 2 C) from Example 17.20 the Weyl 
weight p := a/2, where a is the only positive (and also simple) root of s^C Therefore 

(im — 1)uj + ^,a) = m — 1 + ^(a, a) = m 

and dim^W^j^s^C) = if and only if /' divides m. The condition < (A+p, a) < 
I' is in turn equivalent to < m < I' . Note that by what we mentioned in Example 17.20 
when m = I' the Weyl module is still irreducible even though its quantum dimension is 
already 0. 

It turns out that the Weyl modules with the highest weights in the range < (A+p, a) < /' 
are the most important ones for our purposes. 

Definition 17.34 (Weyl alcove) The (open) Weyl alcove is a subset of the Cartan 
subalgebra f) of sl^C defined by 

C l := {x G I) | < (jc + p, a)< 1, for all a G A+}, 

where I' = I for I odd and = 1/2 for / even. We also denote the set of dominant 
weights in the Weyl alcove by A l w := C fl A w and call this the Weyl alcove when there 
is no confusion. The closure c' is called the closed Weyl alcove. We will often abuse 
notation by saying that a Weyl module is or is not in the Weyl alcove according to the 
location of its highest weight. 
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We already saw that the modules in the alcove have non-zero quantum dimensions. In 
fact, much more is true (see Andersen [9], Andersen and Paradowski [10], Chari and 
Pressley [40], and Sawin [136]): 

Theorem 17.35 The Weyl alcove A l w contains precisely all the (highest weights of) 
Weyl modules W x that are both irreducible and have non-zero quantum dimensions. 
An indecomposable module Q e x (see Theorem 1 7.28) has non-zero quantum dimension 
if and only if A G A l w in which case Q x = W e x .If\£ A' w then A* := -w (A) G A^. 

Recall that Q e x ~ Q x , so the Weyl alcove is closed under duality. In practice for sI^C, 
it is more convenient to write the defining condition of the alcove as follows 

{0 < (x + p, a,-), f° r all simple roots a,- 
(x + p,9) < I , for the highest positive root 6. 

We proceed with some sample computations that make (44) more explicit. 

Example 17.36 Recall from Appendix D that L, := En > — ^Ii = 1, . . . , ./V — 1 form 
a basis in h = h* of bInC. Given a weight it is convenient to write A := Xw=i ^i^i 
by setting Aa? := 0. The highest root is 6 = En — E^n = Ylf^ a i wrtri a i = 
En — Ei + \ i + \ = Li — Li + \ . One easily checks that (Ly, ai) = 5ij — 5i+\j and since 
p = Y^i=l u i we nave 

N-l N N-l N-l 

i=l j=l i=\ y=l 

N-l N N-l N-l 

i=l j=l i=l ;=1 

N-l 

i=l 

= \ X +N - 1. 

Hence the second condition in (44) is equivalent to Ai < I' — N. By analogous 
computation the first condition reduces to A,- > A/+i for all i. This means that A; is a 
partition and the weights A from the alcove are always dominant: C A+ . Thus (44) 
reduces to A being dominant and < \j < I' — N for all i. Note that in the case of 
Chern-Simons theory where / = 2(k + N) this leads to an A^-independent condition 
< Aj < k. Another direct computation shows that A* = X^i( A i — Aat_,- + i)L ; -. 
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Exercise 17.37 Show that in the case of sfeC we have A* = A, that is, all Weyl 
modules are self-dual. 

Exercise 17.38 The symmetric group W = acts on En by aEu = £ CT (!>(0 • 
Compute the induced action on L, : 



L a(ih a(i) < N 



and use it to derive a formula for A* = — wo(A). 

Exercise 17.39 Recall from Appendix D that one can also express any weight as a 
sum of fundamental weights A = Y2i=i n i UJ i an d > for the dominant weights. 
Show that (44) is equivalent to n\ > and Y2i=i <l' —N. Also in terms of r,s we 
have A* = *£?=i m-M 

Exercise 17.40 In the Cartan subalgebra fj of sl^C neither a, nor ujj form an 
orthonormal basis. For s^C orthonormalize a\, ai into u\, ui and show that 

o.\ = V2ui, ct2 = V / 2(— \u\ + ^1*2)5 ^ = Oi\ + 0C2 = V^(j«i + -^^2)- 

Given x = x\U\ + X2U2 describe the condition x G C in terms of jci,X2 and draw a 
picture of C l (cf. Figure 18.3 that shows p + C'). 

The properties of the Weyl alcove indicate how to proceed with the construction of a 
modular category. First of all, we finally have a finite set A' w of simple objects and they 
are indeed simple because alcove Weyl modules are irreducible. To make them dominate 
our category it suffices to consider only tilting modules that decompose into direct sums 
of alcove Weyl modules. Since dual alcove modules are still in the alcove the duals to 
such tilting modules will again decompose into direct sums of the alcove Weyl modules. 
This idyllic picture is spoiled by the behavior of the tensor product: the tensor product 
of two alcove modules may have non-alcove modules in its decomposition. This means 
that if we want a category with 'tensor products' the usual tensor product has to be 
redefined. The idea is to discard the submodule of the tensor product that has quantum 
dimension zero. Since Q e x has positive dimension if and only if it is an alcove Weyl 
module keeping only the latter should do the trick. 

Definition 17.41 Let V = © AgA + (Q\) be a decomposition of a tilting module 

V into the indecomposables as given by Theorem 17.28. Then its reduction is defined 
by keeping only summands with highest weight in the Weyl alcove, that is, 



V.= ® XeK (Q 



A J 
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A tilting module is said to be reduced if V = V and negligible if V = . The reduced 
tensor product of two reduced modules U, V is defined by 

U®V := TTWV. 

Remark 17.42 Actually, we only defined V up to isomorphism. For a strict category 
one needs a canonical construction of it which goes as follows (see Chad and Pressley 
[40]). Let V be the maximal reduced submodule of V and V' its maximal negligible 
submodule then V = V/(V n V) is the canonical representative of the reduction. In 
physics literature reduced tensor product is often called fusion tensor product and rules 
for computing it are called fusion rules. 

The following theorem also due to Andersen [9] (see also [40]) indicates that the new 
tensor product behaves 'properly'. 

Theorem 17.43 If V is any tilting module and U is negligible then so are U V and 
V (g) U . Thus, if U, V, W are reduced we have canonical isomorphisms U® V ~ V®U 
and (U®V)®W ~ U®(V®W) 

f 

Exercise 17.44 A morphism U — > V of tilting modules is called neglible if it factors 
through a neglible tilting module. An equivalence class of equivariant morphisms 
is called a non-negligible morphism. Prove that / is neglible if and only if for any 

g 

morphism V — > U we have Tr q (fg) = Tr q (gf) = 0. 

Remark 17.45 Note that we do not lose any information about link invariants by 
discarding tilting modules of quantum dimension zero. It follows from Definition 16.29 
that the colored invariant Jv u ...,v k (L) is if any one of V,- is negligible. Indeed, we can 
always elongate the strand colored by V, so that it cups under and caps over the rest of the 
link. The entire link then reduces to a morphism Vi ^ Vj with Jv u ... i v k (L) = Tr q (fi). 
But fi is negligible along with Vj and so its quantum trace is zero. This serves as 
topological justification for discarding the negligible modules. 

Definition 17.46 We denote the category of reduced tilting modules with equivariant 
linear maps by TiK e (stjvC). 

Remark 17.47 There is an alternative construction of this category used eg in Bakalov- 
Kirillov [21]. One keeps the ordinary tensor product but takes morphisms to be 
equivalence classes of equivariant linear maps modulo negligible ones (so called 
non-negligible morphisms). The category so defined turns out to be isomorphic to 
TUi e (sl N Q, see Chari-Pressley [40]. 
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The category Tilt^si^C) together with the duality and reduced tensor products is 
exactly what we were looking for. However, two key ingredients are still missing: the 
braiding and the twist. The latter can be restored from the former using dual cups and 
caps as in Section 16.6. But to get a braiding we need to introduce a new structure 
on L^ es (s[;vC) known as quasitriangular structure or the R -matrix. After defining the 
braiding and twist we will see that for even roots of unity Tilt e (sIjvC) will turn out to 
be modular. 



17.4 R -matrices and braiding 

We are making good progress defining a concrete nontrivial modular tensor category. 
By considering appropriate representations of t/f s (s[atC) as representations of an 
associative algebra, we were able to construct the objects and morphisms of an abelian 
category Tilt € dominated by a finite collection of simple objects. The Hopf algebra 
structure of t/^ es (slArC), in particular the counit, the antipode and the coproduct were 
responsible for providing additional structures in the category of representations, namely 
the unit object, the duality and the tensor product. In this article we define the appropriate 
braiding in our category. 

The braiding requires something beyond the Hopf algebra structure. As motivation, recall 
that the (trivial) braiding on REPg from Example 16.6 is just the flip Xj/y(M(g>v) = v®u. 
Of course, given any Hopf algebra A we can define a braiding a on A ® A by 
a % b i— ► b a and this will induce a trivial 'braiding' on any pair of representations. 
The catch is to find a necessarily nontrivial braiding compatible with the rest of the 
structure - in particular leading to the same definitions of quantum traces and dimensions 
as in the previous article. Drinfeld showed that for a finite-dimensional A the following 
notion works [51] (see also Chari-Pressley [40] and Majid [102]). 

Definition 17.48 A Hopf algebra A is called quasitriangular if there exists an invertible 
element R (called the universal R -matrix) living in a certain completion A® A of the 
tensor square A <g> A that satisfies the following axioms 



(id <g> A)(R) = (a ® id)(l ® R) ■ (R ® 1). 
Here a(x (g> y) := y (g> x. If A is finite-dimensional one can take A<S)A = A (8> A. 



(45) 



(A <g> id)(R) 



(TO A 



RAR-\ 

(a (8) id)(l (8) R) -(1®R) 
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When a universal R -matrix exists it is essentially unique. Given a pair of representations 
pu, pv- A —> End(£/), End(V) the universal R -matrix induces specializations 

R UjV : U <g> V U <g> V; R U: v(x <8> y) := (pj/ ® pv)(x y) ■ 
and the braiding will be composition with the flip x uv :=aoR uv . 

Remark 17.49 We have not defined the completed tensor product A® A because we 
will not use it. We will express the universal R -matrix as an infinite formal sum and 
it will not matter what space it lives in. The point is that when it is applied to any 
product of reduced tilting modules it will reduce to a finite sum. Infinite-dimensional 
Hopf algebras such as f/ 9 (s[«C), L^ es (s[,/vC) rarely contain a universal R -matrix in 
A <8> A. For our examples one really has to go out of one's way to construct a big 
enough extension A® A to contain R. In other words we are treating the universal 
R -matrix as the Cheshire cat whose smile lingers on after the cat is gone. After all, 
tilting modules are finite-dimensional and all we need are the induced R -matrices on 
finite-dimensional representations (the smile). 

Aside 17.50 There is a standard way to construct a matrix that satisfies the conditions from 
the definition of a quasitriangular Hopf algebra. The opposite algebra of a Hopf algebra A is the 
algebra A op with multiplication given by a - op b = b ■ a. One first constructs an object living in 
A op (g> A* satisfying the correct relations and then takes the image of this object in the correct 
completed tensor square. If A is any Hopf algebra let {a,} be a basis, {a 1 } be the dual basis and 
set R = a k <S> a k . We will see that this satisfies the required property in A op A* . Write the 
comultiplication in A in index notation as A(a^) = ^ 7^a,- ® aj . This determines the product 
in A* by 

(a'ai)(a k ) = (a 1 ® fl'XAfe)) = 7 f , 

so a'a' — 7^a* . The product in A op is given by a - op b = ba and the comultiplication is the 
same A. Now compute 

(A <g) id)(R) = 1k a i ® a i ® ak 

= ^ $g m ai ® a m <g> 7|?a* 

= ^2 a > ® a i ® a ' a ' = (°" ® i' 1 )^ 1 <8> ) • (1 <8> 

The proof that the -matrix that we are about to define satisfies the same equation is that there 
is a homomorphism taking the R -matrix constructed here to the complicated expression given 
below. This idea is the heart of the quantum double construction. Working out the details 
takes from page 128 to 273 of Chari and Pressley [40]. Drinfeld won a Fields medal in part 
for the work formalizing the idea of the quantum double. The quantized enveloping algebras 
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Uq(slf{C), U^islftC) are not quantum doubles themselves but they are quotients of such and 
the R -matrices can be computed by 'projection' [40], see also Rosso [132]. Roughly speaking, 
if t/+ ( U~ ) denote the subalgebras generated by q ±ai ,ei(q ±ai ,/]•) then U~ ~ £/+* and U q is 
a quotient of D(U^) ~ [/+ <E> U~ . Note that f/+(t/~) are quantum deformations of algebras 
of the upper (lower) triangular matrices hence the name quasitriangular. 

For U q (sl2C) a formal computation following the previous aside yields [40, 102]: 

(46) R = q a ®" Y q n(n-D/2 W g J e n ^fi 

n=0 * 

where the second term acts on x ® y in the obvious way and 

when x,j are weight vectors with weights A, \x respectively. As before a is the simple 
root of s^C and ui = a/2 is the corresponding fundamental weight. This formula is to 
be understood as follows: on any finite-dimensional representation e n ,f" act nilpotently 
and only finite number of terms in the sum (46) are non-zero. Thus the induced matrices 
Ru,v are well-defined for any pair of finite-dimensional representations even though the 
'universal /?' itself is just a formal expression. 

Example 17.51 Recall from Example 17.20 that the action of {/^(s^C) on the 
representation V^^^s^C) spanned by vi, . . . ,v m _i is given by 

(47) ,-v,. = ,^v, eV^^Mv^, /V= [Wl =^v f+n . 

Obviously e" acts as on v,- for n > i and /" acts as on v,- for > m — 1 — i. 
Therefore the sum in (46) applied to v; <g> v ; - truncates at min{/, m — 1 — j} . By (47) v,- 
is a weight vector with the weight (m — 2i)u> and therefore 

(a,(m— 2i+2n)uj)(u! ,(m— 2j— 2n)uj) 



q^iVi-n ® V j+n ) = q K«Arn-^n mK m-,j-,nw v ._ n g, Vj+n 

= q ^2i+2nXm-2j-2n) Vi _ n ^ Vj+n _ 

Substituting this into (46) we explicitly get 

(48) R( Vi ®Vj) = 

min{j,m-l-;} _ 1 . 

V I((m-2,-+2«)(m-2/-2n)+»(n-l)) (g-g ) [m-l+n], ![/+/»],! 

^ ^ [n\ q \ [m-i] q \\j] q \ 
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<? 5 vi (8) vi + q i(q-q ) — ^j-^ v <8 v 2 



To make this more transparent we will compute this expression for m = 3 and some 
pairs ij. 

2 

#(vo <8 vo) = <? 2 vo (8) vo 

«(vi ® vi) = q^- 2) °- 2) Vl ® vj + 9 i<3-2+2X3-2-2) (9 - 9 - 1 )[3],[2] ? v ® v 2 

,(g 3 -g- 3 )(g 2 - g- 2 ). 

(^r-^r- 1 ) 2 

= gzvi (8 vi + <?~2(<7 3 - <7~ 3 )(<7 + <? _1 )vo <8 v 2 
#(v ® v 2 ) = <72 3(3_4) v (8 v 2 = ^"2v (8 v 2 . 

Exercise 17.52 Compute R(v t (8 v,) for m = 2, that is, for V = V<2(sl 2 C). Show that 
in the lexicographic basis vo <8 vo, vo <8 vi , vi <8 vo, vi (8 vi of V (8 V the /?y,v xs given 
by the matrix 

(q 1 0^ 
1 4 2 -4~ 2 
1 
\0 q 2 J 

Remark 17.53 A reader may notice that expressions in Example 17.51 involve 
fractional powers of q that do not belong to C(q) that is technically our field of 
coefficients. In fact this is a direct consequence of having the term q a ® w in (46) 
that produces (u>, fi)th power of q when acting on u (8 v with v having weight (j,. 
Since w = f this number is potentially a half-integer. In general, the analogous term 
produces det | a ..) powers of q, where ay is the Cartan matrix. For sInC an elementary 
computation shows that det(a ; y) = N and for the R -matrix formula to make sense 
we have to extend the coefficients to C(q l l N ). Formally, we now have to work in 
U q \/N{s{^C) := t/ 9 (s[#C) ®c(q) Ciq 1 ^) instead of J7 (s[jv<C). If we apply the same 
process to U i/n(sInC) that was used in Definition 17.12 to obtain f/^ es (sljvC) from 
UJsInC) we will get the same algebra assuming that q l l N is specialized to the principal 
value of eN ■= e N(k+N) This allows us to continue doing all computations with the 
indeterminate q as explained in Section 17.2, rewrite the results in terms of the divided 
powers and Laurent polynomials and only then specialize to e . 

The formula (46) can not be specialized to L^ es (s[ 2 C) directly because of the [n] q l in 
the denominator. This is easy to fix following our general ideology by noticing that 

= [n] q \ & 
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Thus for L^ es (sbC) formula (46) is replaced by 

oo 

(49) R = q**®" e n{n ~ l)/ \e - e - 1 ) n [n] e ! e (n) ®f {n) . 

The inconvenience of computing the action of the divided powers for this formula can be 
bypassed as follows. Recall that we are interested in the Weyl modules VVL ^(s^C) 
where m lies in the range 1 < m < (k+N) (the Weyl alcove). In this range r 7, still 
makes sense and (46) with q = e can be used equivalently. 

The appearance of fractional powers is not the only nuisance we have to deal with when 
moving on to U i/n(sInC). To write the R -matrix for N > 2 we need analogs of the 
root vectors e,-,/; for non-simple positive roots a. Unlike the classical case there is no 
canonical way to introduce such. A correct generalization comes from the following 
classical observation [40]. Let W(= (25 at) be the Weyl group of sl^C (=the symmetric 
group) generated by reflections Si = s a . in the hyperplanes orthogonal to the simple 
roots a.\. In the basis these act as the transpositions s& = (k k + 1). It is clear from 
this description that the Weyl group acts transitively on the root vectors e,y := En — Ejj. 

More explicitly, each element in the Weyl group admits a presentation a = . . . Sj v 
as a 'word' in generators. A word representing an element is called reduced if it has 
the shortest possible length v (such a word may not be unique). Let wo £ W be 
the order-reversing permutation that we already met in connection with duality and 
wo = Sjj . . . S[ v be a reduced word for it. Then each positive root occurs exactly once in 
the following sequence 

(50) Pi := a iv Pi := s h {a h ), ...,/?„:= s h . . . Ji l/ _ 1 (a I -„). 

This gives a natural enumeration of the set of all positive roots. The standard choice of 
a reduced word for 5(^C is 

(51) w = ^i(^i)fe^i) • • • Oat-i • • • s 2 si) 

and it gives the anti-lexicographic (read from right to left) enumeration for the roots, 
namely 

P\ = E\i — E22, Pi = E\\ — £33, pj = E22 — £33; 

(52) p 4 = E n -E44,..., p 6 = £33 - £44, . . . , 

Pv-N+2 = £ll — £AW, • • • , Pv = Efj-lN-l — £aW- 

Exercise 17.54 Verify the last claim. 



Qeometry & Topology Monographs 8 (2006) 



366 



Dave Auckly and Sergiy Koshkin 



We now consider how to define standard root vectors in the quantum setting. For this 
we need analogs of reflections Si for quantum groups. They are given by the Lusztig 
automorphisms [40, 100]. 

Definition 17.55 Define algebra automorphisms Tj : U i/n(sInC) — ► U i/n(sInC) by 
the following action on the generators 



i 


=j 


\i ~j\ 


> 2 


j 


= i±l 


T t q ±ai 


= q^ a ' 


T iq ±a > 


= q ±aj , 


Tiq a > 


= q a 'q a \ 




= -f<q a - 




= e h 


Tiej 


— q Vftji 


m 


= -q a 'ei 








= qfifj -fjfi- 



(53) 



For a € W = &n represented by the reduced word a = ... Si v set T a := 7/, . . . Ti v . 
The operators T, = T s . , T a are called the Lusztig automorphisms. 

Note that Tj and T a are only algebra, not Hopf algebra automorphisms (they do not 
preserve the coproduct). For general permutations T a are well-defined due to the 
following result of G Lusztig [100]. 

Theorem 17.56 The action of T a depends only on a and not on the choice of a 
reduced expression for it. All T a are invertible and E/^ w (s[yC) is invariant under all 
of them. Furthermore, the following relations hold 

T i T j = T j T i , |/-j|>l 

(54) 1 1 

Relations (54) are nothing other than the defining relations for the braid group Bn on N 
strands. In this group multiplication is concatenation of braids and inverse is the mirror 
image. The ? ; generator of the braid group corresponds to a simple crossing between 
the ith and (/+l)st strands [40, 102]. Theorem 17.56 implies in particular that i— ► T\ 
defines a representation of the braid group in U i/n(sInC). Also note that the forgetful 
map Bn — > &n that only keeps track of the permutation on the strands is an infinite 
cover of the Weyl group of sIatC. 

We now introduce the quantized versions of the root vectors corresponding to non-simple 
roots. 

Definition 17.57 Let a G A + be a positive root and a = flu from (50). Then set 

(55) e^:=T h ...T ik _M: } ), f^ ■= T h . . . ^tff), 

where i\, . . . , 4 are as in the reduced expression (51) for wq. Naturally, we denote 



c a >Ja - — Jet • 
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Exercise 17.58 Show that if a = a* is simple then = e\ n) , and = ff"> as 
expected. 

With this notation we can now write down formulas for the R -matrices for U i/n(sInC) 
and U"* /lf (sl N C) that generalize (46) and (49): 



»l,...,n„=0 agA+ 



(56) 



n\,...,n v =<S aeA+ 

The product in (56) is not commutative and should be computed 'in reverse' to (52), 
that is, /3 V , . . . , /3i so that /?i term is applied to the tensor product first (hence the «— ). 
The 7? -matrix for ^ es (5[^C) is obtained from the second line in (56) by replacing q 
with e [40]. 



Example 17.59 In s^C the simple roots are ai = E\\ — E\\ and a 2 = E\ 2 — £33 and 
the only other positive root is a\ + a 2 = E\ x — £33 . In the ordering of (52) we have 
A = a u (3 2 = a\ + a 2 ,/?3 = «2- Hence ef x = e ( "\e ( £ = e 2 n) and e ( f 2 = Ti(e { 2 n) ). 
For ?i = 1 we get from (53) 



T\(e 2 ) = q e 2 e\ — e\e 2 = — e\e% + q l e 2 e\ 
and because T\ is an automorphism 

(57) = Ti(e 2 ) 2 = (ei^l^ + <7~ 2 e 2 eie 2 ei) - ^(ei^i + e 2 e\e 2 ). 

We now want to rewrite this in terms of the divided powers and Laurent polynomials 
in accordance with the general strategy from Remark 17.53. Taking into account 
that q + q~ l = [2\ q we have by the quantum Serre relations from (31) e 2 e\e 2 = 



[2] 



9 

\r-(e\e\ + £1^2) an d therefore 



ei(e 2 eie 2 ) + q 2 {e 2 e\e 2 )e\ = ^r-{e\e\ + e\e\) + q 2 {e\e\ +e\e\)-^- 

[2] q +q [2] q +{L+q } [2] q 



22 22 

e \ e 2 1 „-2 e 2 e \ 



Substituting into (57) yields 



[2]q 1 [2] 



2 2 2 2 



+ q 7^— + q e\e 2 e\. 



Tl (4) = c -^+ q -l^l - q- l e 2 e\e 2l 

v m, mq H 1 



Qeometry & Topology Monographs 8 (2006) 



368 



Dave Auckly and Sergiy Koshkin 



and dividing both sides by [2] q 

/<Q\ (2) t I ( 2 )\ (2) (2) -1 (2) i -2 (2) (2) 

(58) e'^ = Tiiey) = e\ 'e\ '-q e 2 e\'e 2 + q e\'e\'. 
Analogously, 

(59) fg = r l( /f ) = q\fef - +/ 2 (2) / 1 (2) - 
Formulas for L^ es (s[^C) can now be obtained by setting q = e. 

Exercise 17.60 Prove by induction that 

e% = ± q - k e?e?et k) 

k=0 

and derive an analogous formula forfj^, see [100]. 

Example 17.61 We will now compute the action of the R -matrix of the fundamental 
representation in the SI3C case. Recall from Exercise 17.15 that the fundamental 
representation V^CsfeC) has a basis of weight vectors vi := uq, v 2 : = f\u^, v 3 :=f 2 v 2 
having weights u\ , u\ — ot\ and u\ —a\ — a 2 respectively. This specifies the action of 
the q ai . The remainder of the action is specified by fivj = Syvj+i and e,V/ = <5;+iyV/_i . 
Continuing with the computation from the previous example we have 

efo = ei, ep 2 = -e\e 2 + q~ l e 2 e\, = e 2 , 

ffo =/ij ffh = ~M\ + <Z/i/2, ffa =h- 

Since vi is annihilated by e\ and e 2 , only the qT.^^i factor of the R matrix acts on 
vectors of the form v\ x (see equation (56)). Since the weight of the fundamental 
representation is u\ = □ = — \Ej 2 ~~ 3^33 > ^ follows that 

R( Vl ® Vl ) = q (<*iVi)("i,"i) Vl g, Vl = q 2/3 Vl ® vi, 
R( Vl ® yjj) = 9 (aiM)(^,«i-«i) Vl ®v 2 = q~ l/3 Vi ® V 2 , 
R( Vl ® y 3 ) = 9 <ai,wi>(«i,wi-«i> Vl ® v 3 = <?" 1/3 Vi 8) V 3 . 

Now 

ep { v 2 = vi, e^v 2 = 0, e ft v 2 = 0, 

so the only terms that contribute to the R -matrix evaluated on vectors of the form v 2 <g> x 
will correspond to the root j3\ . We compute 

fp{V\ = V2, fp Y V2 = 0, efcV3 = 0. 
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It follows that 



R(v 2 <8> vi) = q^ a '^'(v2 ®v { +(q- q~ X )vi <g> v 2 ) 

_ ^<ai,wi-ai)<a;i,a;i)+<a2,a;i-Q;i)<W2,a)i) V2 g, yj 

+ (q - ^- 1 )^( a i^i>(^i. w i- Q! i>+( Q! 2,wi>(w2,wi-o!i> Vi (g, 

= <7" 1/3 v 2 ® vi + q~ 1/3 (q - tf _1 )vi (g> v 2 , 
R(y 2 (g) V2 ) = 9 <ai,wi-ai)<wi,a;i-ai)+<a2,wi-ai)<a^,a;i-ai) V2 V2 

= <? 2/3 V 2 (8> V 2 , 

/?(v 2 V3) = ^< a l' a; l _a l)^l' w l~ a l~ Q! 2) + ( Q! 2,^l-ai>(^2^i-ai-a2> V2 g, V3 

= <7~ 1/3 v 2 (8> v 3 . 



Now 



e^v 3 = 0, e ft v 3 

/ft Vi = -V 3 , //3 2 V 2 
//3 3 Vl = 0, / ft V 2 



-VI, e / 3 3 V 3 =V 2 , 
0, //3 2 V 3 = 0, 

V3, //3 3 V 3 = 0. 



It follows that 



/?(v3®vi) = ^ ai ® a, ''(v3<g>vi+(tf-4 _1 )vi<g>V3) 

_ (ai,a;i-ai-Q2)<a;i,a)i)+(a2,a;i-ai-a2><<^,u;i) V3( g )Vl 

+ (^-^- 1 )^ <ai ' CJl)<Wl ' CJ '- ai - a2) Vi0V3, 
= ^" 1//3 V 3 (g)Vi+^ _1/3 (^-^ _1 )Vi(g)V3, 

/?(V 3 0V 2 ) = ^ Ea,0a; '(V3®V 2 +(^-^" 1 )v 2 ®V3) 

= ^( a i' w l _a i _a 2>(^l,^l-ai> + {a2^l-ai-a2>{^2,^l-Ql> V3( g 1V2 

+ (9— ^ _1 )^( Q i' £i 'i _Q; i)( a; i ,wi-o; 1 -a2)+<a2,wi-ai>(w2,wi-a 1 -0:2}^^^ ^ 

= ^r" 1/3 V 3 (g)V 2 +^ _1/3 (^-^ _1 )v 2 (g)V3, 
/?(V3<8)V3) = ^( a i' w l- a i~ a 2>(^l,wi-ai-Q!2>+(a2,wi-Q!i-a2>(w2,wi-ai-a2> V3 V3 

= ^r 2/3 V 3 ®V 3 . 

Ordering the basis for V^, (sl 3 C) (SI3C) lexicographically we may write the matrix 
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for R as 



1 



(q-q- 1 ) 



1 



(q-q- 1 ) 



1 



R = q~ 



1/3 



1 



(q-q- 1 ) 



1 



1 



This last example can be generalized to the fundamental representation V^^sInC) for 



Exercise 17.62 In general the fundamental representation has a basis with highest 
weight vector vi and v 7+ i :=fjVj. Show that in this basis we have, u>i(ej) = -E/y+i, 



The answer to this exercise can be found on page 277 of [40]. It is important because 
it provides the link between the quantum invariants as we are defining them, and the 
THOMFLYP polynomial and skein theory as sketched in Appendix C. 

It follows from Exercise D.7 that the representation 



for UqisUC) has 126 nontrivial weight spaces, we know that the dimension of 



is 126. It follows that the R -matrix is a 15876 x 15876 matrix. As the reader can see, 
computing the R -matrices explicitly is no mean feat even for simple representations. 
The next exercise is one of the few remaining cases that can be worked out reasonably 
by hand. 

Exercise 17.63 Compute a basis for the representation 



all N. 





(sUC) 




(sUC) 



Compute the action of q°" , 



V*(sl 3 C). 
ef , fi , and the 7? -matrix. 
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Recall that when Uq(slpfC) acts on a vector space V, we can define an action on the 
dual V* by 

(ag)(v) := g(7(a)v). 

To see how this works, consider the action of [/^(sfeC) on the dual of the representation 

V*(s( 2 C). 

Example 17.64 Recall that the representation 

V*(sl 2 C) 

has a basis vi, v 2 - We denote the dual basis by v 1 , v 2 . Since (ev 1 )(vi)=v 1 (7(e)vi) = 
and (ev l )(v2) = v i ('y(e)v2) = —q, we have ev l = —qv 2 . Similarly, ev 2 = 0,/v 1 = 
and fv 2 = —q~ 1 v i . We can now see how the R -matrix acts on terms that include 
functionals such as u\ <g> uj\ in the s\% case. We have 

R( Vl <g) v 1 ) = q~ 1/2 vi V 1 
R(yi ® v 2 ) = q l/2 v\ <g> v 2 
R(v 2 ®v l ) = q l l 2 v 2 ®v x 

R(v 2 <8> v 2 ) = g~ 1/2 v 2 ® v 2 - q~ 3/2 (q - q~ l )v x ® v 1 . 

Recall that we can define link and framed link invariants from a ribbon category. The 
R -matrix is the main tool to build a ribbon category from the representations of a 
quantum group. In fact the category of type I representations of U i/n(sInC) becomes 
a ribbon category if we use the braiding given by x M> y = a o R\jy . We denote the 
(framed) link invariants resulting from this category by Wf N where A is a collection of 
representations (one for each component of the link). We next compute the invariant 
n (left Hopf link) from the definition. First compute the composition of the top three 
morphisms from Figure 16.2. We have 

1 3/2 1 I 

Vi ® v i— ► q ' V\ <g) v ^ qv <g> v\ i— > q 

v 2 ®v 2 ^ q 3 ^ 2 v 2 <8> v 2 h-> ^r 3 / 2 [^r _1 / 2 v 2 ® v 2 - q~ 3 ^ 2 (q - q" 1 ^ 1 (g> vj i-> q~ x . 

We also have vi (8> v 2 and v 2 v 1 0. Notice that this recreates the double 
dual isomorphism. To continue the computation of the invariant for the left Hopf 
link it is helpful to write out the matrix for the braiding in the lexicographic basis for 
V^(s[ 2 C) <g> V^(s[ 2 C) from Exercise 17.62. This is our first sample computation of a 
braiding. 
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Example 17.65 We have 



-q 



-1/2 



1 

1 (q-q~ l ) 



q. 



.-2 



,-2 



(q- l -q) 2 +\ (q- l -q) 
{q- l -q) 1 



q 



-2 



We can combine our computations of the twist, double dual pairing and braiding to 

y 



compute the invariant of the Hopf link. We use vf to denote v,- <S> v k and vjf to denote 



v; <S> Vj v <g) v . The computation is contained in the following example. 

Example 17.66 Following the element 1 up through the morphisms as depicted in 
Figure 16.2 gives 

1 i— ► v} + v 2 

h+ q- l v\\ + (q(q - q~ 1 ) 2 + q)vf 2 + (1 - q 2 )v 2 2 \ + (1 - q 2 )v\ 2 + qv\ 2 + q^v 22 
^v\+ ((q - q' 1 ) 2 + l)v} + q 2 v\ + q~ 2 v\ 

q + q(q - q~ 1 ) 2 + q + q + q~ 3 = q 3 + q + + g~ 3 . 



We conclude that 



W^ l2 D (left Hopf link) = q 3 +q + q~ i + q~ 



We can use this type of computation to derive an interesting recurrence relation for the 
framed invariants. Notice that the twist in the fundamental representation is given by 



It follows that 



:(v«) 



i(Vij) = q 



-1/N 



X a ,a(Vkt) = q 1/A W + (<?-<? l >u)i k > £. 



3 (va) 



q l/N q- l Vii 



X n,n(V;7) 
X ' ' (vft) 



q 



i <j 



q i/N (v ke -(q-q >«), k > I. 



Exercise 17.67 Use the previous computations to show that 

(q - g -1 )id n) n . 



a,n q 
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Notice that this implies that the framed link invariant W s[n (all representations taken 
to by fundamental) satisfies a skein relation. Also notice that the invariant defined via 
quantum groups is provably invariant under isotopies. A framed link invariant is very 
close to being a link invariant; it is invariant under Reidemeister moves two and three 
from Figure 13.1. It follows from Exercise 18.7 below that 

W sW (closure(<9| of)) = g^ 1 ^^ (closure^)) . 

Here closure is just the quantum trace in the category of framed tangles; see Figure 16.6. 
This motivates the following definition of the THOMFLYP polynomial. 

Definition 17.68 The THOMFLYP polynomial is defined by 

P(L) := ? (V")ESE*!%W r i» (L) . 

Here c(L) is the number of components of L and is the linking matrix of L. 

Remark 17.69 To define 3 -manifold invariants we have to go to the category or 
reduced tilting modules Tilt e {5i^C). Of course this is also a ribbon category that 
generates link invariants. The resulting invariants are just the evaluation of P(L) at 
qi/N _ e m/(N(k+N)) j n t ^j s wa y j s a f unc tj on f n and a primitive even root of unity 
e = e i™/( k + N ) _ One can show [40] that for each L there is a unique rational function in 
variables A 1 / 2 , q 1 / 2 denoted by V(L) such that P(L) = T(L){\ = e 2N , q = e 2 ). 

Exercise 17.70 From Exercise 18.4 we know that 

P(unknot) = ql/2 _ q _ 1/2 

Prove that the THOMFLYP polynomial satisfies the following skein relation 
A 1/2 P(closure(x M of)) - A~ 1/2 -P(closure(x r, 1 of)) = (q 1 / 2 - q- 1 / 2 )P(closure(/')). 

This (skein) recurrence relation together with the value of the unknot specifies the link 
invariant uniquely. In fact one can turn the entire process around and use the skein 
relation and normalization as the definition of the link invariants. It is not obvious that 
such a definition is well-formed. One must show that different ways of applying the 
skein relation lead to the same answers and one must show the resulting quantity is 
invariant under the Reidemeister moves. In addition to the approach to the definition 
that we used via quantum groups there are several different approaches to do this. The 
first are described in the original paper [60]. 

Much more is true - the full Reshetikhin-Turaev invariants of links in general 3- 
manifolds can be recovered from the skein theory. This is described in Appendix C. 

Exercise 17.71 Compute ^(Right Hopf link) from the definition and via the skein 
relation and reconcile the two answers. 
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18 Reshetikhin-Turaev invariants from quantum groups 

We are now in a position to combine all of the facts we derived about representations 
of quantum groups into a definition of a nontrivial strict modular category. This is the 
category of reduced tilting modules at even roots of unity based on the Lie algebra sl^C 
Historically, Reshetikhin and Turaev constructed their invariants directly [129, 130] and 
only later Turaev [152] streamlined their construction into a notion of modular category. 
In this section we will finally define these invariants. First we review the main features 
of Tilteislfi'C), then compute its characteristic numbers, the S -matrix and finally the 
Reshetikhin-Turaev invariant of S 3 , aka the Chern-Simons partition function. 

18.1 Modular category of reduced tilting modules at even roots of unity 

In this article we summarize the modular category structure on Tilt e (slNC) when 
e is a primitive even root of unity. This is the modular category that produces the 
Reshetikhin-Turaev (quantum) invariants of 3-manifolds along the lines of Section 16.7. 
After discussing the main ingredients we compute some of the characteristic numbers 

2tt> 

of this category (see Definition 16.23). It is assumed throughout that e = e i , where 
I is even. To draw a connection with the Chern-Simons theory set k := 1/2 — N for 
the level of the theory. Then I = 2(k + N), I' = k + N and e = e^ . One should 
keep in mind that there is no mathematical definition of Witten's path integral, hence no 
rigorous way to compare Witten's invariants to the Reshetikhin-Turaev ones and the 
above connection is merely conjectural. 

Objects The objects are reduced tilting modules V, that is, representations of 
C/£ es (s[#C) (Definition 17.12) that are finite direct sums of Weyl modules (Defini- 
tion 17.19) with the highest weights in the Weyl alcove A l w (see Definition 17.34 
or Index set below): V = AeA , {Wj)® mx{V) . 

Morphisms The morphisms are equivariant linear maps f(av) = a(f(v)) . 

Unit object The unit object is ¥ := C with the trivial representation structure 
az '■= e(a)z, where e is the counit (32) of Uf s (s[^C). 

Dual objects The dual objects are the vector space duals V* of tilting modules with 
the action given by the antipode (32) (aip)(v) := ip(j(a)z)- 

Pairing and copairing The pairing and copairing are the usual ones from the category 
of finite-dimensional vector spaces 

n v (<p <8> v) := (p(y), l> v (l):=^2v k (g)v k , 

k 
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where v^, v k are dual bases in V, V* respectively. 

Tensor product The tensor product is the reduced tensor product (g) from Defini- 
tion 17.23, that is, the maximal invariant subspace U®V of the ordinary tensor 
product U V that is a direct sum of Weyl modules with highest weights in the 
Weyl alcove. The action restricts from the action on the usual tensor product 

a(u (g) v) := A(a)(u ® v) on U (g) V. 

Braiding The braiding is x^y := a o Rjjy, where a(u (g) v) := v <g> u and Ru,v is 
the restriction of the 'universal R -matrix' given by (56) to the ordinary tensor 
product of U,V. 

Twist The twist is obtained from the braiding, duality and double duality as in 
Figure 16.8, explicitly 

9 V := (idy (g> Dy) o (Xy,y <g> idy*) o (idy <g) Uy) , 

where n y (v (g) <p) := cp(q 2p v). 

Index set The index set of simple objects / is the set of dominant weights (see 
Appendix D) in the open Weyl alcove 

/ : = A' w = {A g A+ | (A + p, a)< 1/2, for all a £ A+}. 

The unit object is indexed by A = 0. 

Index involution The index involution is given by A* := — wq(A) (see Example 17.36), 
where wo is the order-reversing permutation and the action of the symmetric group 
W = &n on weights is described in Appendix D. In particular, 0* = 0. The dual 
weight is the weight of the dual simple object: W^sl^C)* = W^*(s[jv<C). 

Simple objects The simple objects are the Weyl modules W|(sIatC) with highest 
weights in the Weyl alcove A G A l w (alcove Weyl modules). 

For convenience we list the salient points of the definition in Section 18.1. 

As we described in Section 16.6 one can define a dual pairing and copairing in any 
modular category. In Tift e (s[/vC) we have, n v (v (g> <p) := ip{q 2p v) and U y (l) := 
J2k yk ® Q 2Pv k, where p is the Weyl weight of sl^C (see Appendix D). 

This is not yet a strict category because equality signs in the axioms are only canonical 
isomorphisms. For example, V is not equal to V ® C. In truth, we should be talking 
about isomorphism classes of tilting modules rather than tilting modules themselves, or 
selecting canonical representatives of those classes. One faces the same nuance in the 
category of representations of classical Lie groups. Formally, some tweaking in the 
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Category 


Tilt e (sl N C), e := e 2 ™/ 1 , 1 even 


Objects 


Reduced tilting modules V = A6A ; (W^)® mA(y) 


Morphisms 


Equivariant linear maps f{au) = af(u) 


Unit object 


C with az ■= e(a)z 


Dual object 


Dual vector space with (atp)(v) := ip('y(a)z) 


Tensor product 


Reduced product with a(u (g) v) := A(a)(u (g) v) 


Pairing,copairing 


r\ v {ip tg> v) := ip(v), Uy(l) := £ t v k ® v k 


Braiding 


xu,v(u ® v) := a(R(u (g> v)) 


Twist 


9 V := (idy (£) Dy) (Xy,y <g> idy*) o (idy Uy) 


Index set 


Weyl alcove / = A l w 


Index involution 


\* : = -wo(A) 


Simple objects 


Alcove Weyl modules {W^I^a; 



Table 18.1: Modular category of reduced tilting modules at even roots of unity 

notions of morphisms and ribbon operations is required. It can be done in a standard 
way by the Mac Lane coherence theorem [101] but in practice one can safely ignore the 
difference. 

To see that the ingredients do indeed form a strict modular category, we will sketch 
proofs of some of the axioms (16.5). The tensor axioms (Axioms 1^1) essentially follow 
from Theorem 17.43. The braiding axioms (Axioms 5-7) follow from the defining 
properties (45) of the R -matrix. There is a subtlety here since (56) is only a formal 
expression but (45) holds if R is restricted to a pair of finite-dimensional representations 
[40]. 

Exercise 18.1 If R = Yl a ® b write Ru = ^ a ® 1 <8> b and R 2 3 = Yl 1 ® a ® b. 
Compute the left hand side and the right hand side of Axiom 6. Conclude that Axiom 6 
follows from 

(A ^ id)(R) = R l3 Ri3 ■ 
To verify Axiom 7 write R = Y a ® b and compute 

(g ®f) o x V:W (s ®t) = (g ®f)(j2(bt) ® (as)) 

= 5^(%(0) ^ (af(s)) = Xy/ jW , o (f® g)(s ® t). 

The twist axioms and the last duality axiom (Axioms 8, 9 and 12) can be verified by 
manipulating tangle diagrams. One uses the graphical definition of the twist from 
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Figure 16.8. For example Axiom 8 is verified in Figure 18.1. The remaining duality 
axioms (Axioms 10 and 11) are trivial computations. Axioms 13-16 follow from 
Andersen's Theorem 17.35 and the construction of Tift e (s^C). See Andersen [9], 
Andersen and Paradowski [10], Chari and Pressley [40] and Sawin [136] for more 
details. The only remaining axiom is 17, the non-degeneracy of the 5-matrix. It will 
be proved as a byproduct of computing the Chern-Simons partition function of S 3 in 
Section 18.2. 




Figure 18.1: Proof of axiom 8 



As we already mentioned the link between the Reshetikhin-Turaev and Witten-Chern- 
Simons invariants is only conjectural. In the absence of a formal definition for the latter 
we can simply identify them with the former for the category Tilt e (slNC). 

Definition 18.2 The Reshetikhin-Turaev invariant corresponding to U q {5i^C) at level 
lc is defined to be the invariant r from definition (16.35) arising from the category 
Tilte(slNC). It is denoted by t s1nC (M). The Chern-Simons partition function is a 
different name for the same thing. It is denoted by Z CS (M) := t s1nC (M). 

As a warm-up to the computation of Z cs (5 3 ) we compute the characteristic numbers 
(Definition 16.23) of Tilt t (slNC) here. We already computed the number d\ in (43), 
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but we would like to simplify it. Recall how this went. The morphism associated to an 
unknot labeled with A sends 1 to d\. Let {v,} and {v'} be bases for the representation 
V| and its dual respectively. The morphism is a composition of two morphisms and we 
see that 

1 ^ Yl Vi ® yi " ^v ! (A( ? 2 ")v ! ) = Tr(A(g 2p )) = X x(q 2p )- 

It follows that d\ = xa(<? 2p )- The Weyl character formula states that the characters 
are just Schur polynomials, which in turn are ratios of determinants. In this case the 
determinants are Vandermonde determinants. The appearance of group characters was 
explained in Remark 17.30. 

Since the simple objects are the Weyl modules and the Weyl modules are specializations 
of V? we have, 

(A+p,a) _ -(\+p,a) 

d x =dim q w{= n 

Substituting e = e k + N yields 

(ua) -(ua) e*+x^''-e k+N^' (win, a) 



Ap,a) _ e -(p,a) 



2i \k + N 

for any weight fi. For sl^C the positive roots are ay = En — Ejj with i <j; therefore, 
we explicitly get 

sin 1 — — 



(60) dx = n \ k ( + \ 

Exercise 18.3 Note that a,j = X^{=/ a k afi d p = X^i 1 u i- Use the fact that a,-, Uj 
are biorthogonal to derive that (p, ay) = j — i. 

Exercise 18.4 Show that d UJl = (e N — e~ N )/(e — e^ 1 ). This is the invariant of the 
unknot in the fundamental representation. 

In principal to compute other invariants it appears that we have to compute some braiding 
morphisms. Notice that when the dimension of a representation is n, the R -matrix 
will be a n 2 by n 2 matrix. Clearly it is not practical to write out many R -matrices. 
Fortunately, sometimes one can get away with knowledge of its action only on elements 
of a special form, namely u\ (g) v S U <8> V with u\ being the highest weight vector in 
U with the weight A G A+. This observation is due to V Turaev and H Wenzl [153] 
and will come handy for the computations of and . 
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Proposition 18.5 Let u\ (v\) be the highest weight vector ofU (V) and let v (u ) be 
arbitrary. Then 

(61) Ru,v{u\ ®v) = u\® q\, 

(62) Ru,v(u <8> v\) = q x u (g> v A + z<\, 

where z<\ is an element of U (g> V<\ and V<\ is the sum of weight subspaces of V 
with weights < A . 



Proof By definition of the highest weight we have e^u\ = for all /. We also get 
e^u\ = for all positive roots from (55). Hence the only term that survives in the sum 
of products in (56) is the one that corresponds to v = 0, ni, . . . ,n v = and the whole 
sum reduces to 1 (g) 1 . Therefore, we only have to compute g^*<=i at ® Ui (u\ ® v). Since 
both sides of (61) are linear in v it suffices to prove it for the case when v = is a 
weight vector with weight fi. We have 

9 E£i' a -®"-(u X (8) V = ^tl'^a-X^M)^ $ 

= 9 (E£i 1 (A,a»)w»,E£i 1 (A^,)a,) BA ^ = ^(A,^) MA $ ^ 

since a;,, ay are biorthogonal (see Appendix D). But by the definition of the weight 
<jA,m) v ^ _ q x Vfi an d we are done with the first formula. The proof of the second 
formula proceeds analogously but va is acted upon by f^' and f$9v\ is no longer zero. 
Instead, it has a weight lower than A except in the case when n = 0. Thus, up to lower 
weight vectors in V the sum of products in (56) again reduces to 1 ® 1 and q^i=i a '® w i 
reduces as above to q x acting onu. □ 

Exercise 18.6 Derive formulas analogous to (61) and (62) for the case when u\, vx 
are the lowest weight vectors. 

Now we compute pf := Tr q (9^ 1 ). Since the twist commutes with all morphisms (by 
Axiom 9) and is irreducible for A in the Weyl alcove (Theorem 17.35) by the Schur 
lemma 9^ 1 must be a scalar operator and it suffices to compute it on a single vector in 
W\. In view of the simple form of the R -matrix on pairs containing the highest weight 
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vector (Proposition 18.5) we choose the highest weight vector v A G W A : 

6 x v x :=(idA®n^)o(x A>A ®id A .)(^ va®v*®v*J 

^ k ' 

=(id A g> n* x ) ( x a,a(va ® v fe ) ® J 
^ k ' 

=(idA®n A )f^( ? A v jfc (8)v A )(8)vM, by (61) 

= ^ vV P v A )^v, = e (2p ' X) v k (v x )q\ k 
k k 

=e (2 P ,X) q x(j2 v k (v x )v k ^j = e^ X) q x v x = e^e^ x \ x = e^^W 
The case of 9^ 1 is analogous so 



(63) 9f = e ±(2 " +A ^id A . 
Taking quantum traces on both sides yields. 

(64) p± = e ±(2 ^ X) Tr 9 (id A ) = e^ 2p+X ' X) d x . 

Exercise 18.7 We have seen that m is just multiplication by a scalar. This scalar is 
called the framing anomaly. Show that the framing anomaly is € N ^ l / N _ 

18.2 S-Matrix and Chern-Simons partition function for S 3 

In this subsection we compute the entries S A „ of the structure matrix from Axiom 17 and 
we compute the partition function of S 3 . Following Turaev and Wenzl [153] to compute 
s Xfl we introduce the meridian morphism T X)l : W\ — ► W A , see Figure 18.2. One can 




Figure 18.2: The meridian morphism 
see by inspection that s Xfl = Tr q (F x ^). Again, T XlM commutes with any morphism 
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since we can slide the latter up and down the A strand, and acts as a scalar since 
it is an endomorphism of an irreducible representation. It is therefore sufficient to 
compute it on a highest weight vector v\. Let {«,■} and {«'} be dual bases for V M and 
V* . Reading the ribbon expression off Figure 18.2 we have 

va h-> ^ va (8> m ®u 1 ^ '^^(q x ui) (g> va <8> u l 

i i 

i ^ ^ v\(q 2X Ui + lower weight) <g> u' : i-> ^ v \uXq 2(p+X] '«,-) 

f i 

= (TrM^ 2(p+A) ))vA = X^(^ 2(p+A) )vA. 

Here we abuse notation denoting by /i the irreducible representation of st^C indexed 
by the dominant weight fi so that Tr //(•) is the ordinary trace in this representation. 
Taking the quantum trace of this Ta« morphism implies that 

(65) S Am = TVA(r A/ J) = Xa(? 2 ")X / ,(? 2(p+A) ) = X/,(-? 2(p+A) )^a. 

In particular, Sqo = 1 . 

Since S 3 can be obtained from itself by a surgery on the empty link 0, we compute 

Z CS (S 3 ) := r sl * c (S 3 ) := (p-)^)p-^)-^)-iF(0) = P" 1 , 

where recall P 2 := X^ag/^a anc ^ = Sag/ ^a ^a- Note that there are many other 
links that produce S 3 and using expressions for them produces universal algebraic 
relations among the characteristic numbers of any modular category. Since we already 
computed d\ in (60) it appears that we just have to attach a sum and a square root to 
the formula. The problem is that we are ultimately interested in the Chern-Simons free 
energy and this is the logarithm of the partition function. Hence a multiplicative, not an 
additive, expression is desirable. This can be treated of course as a problem in special 
functions theory, but we prefer a more conceptual approach based on an understanding 
of the geometry of the Weyl alcove. The trick is to represent the Weyl alcove as a 
fundamental domain of a group action; see Kirillov [86], Kac [82], Samelson [135] and 
Sawin [136]. In addition this will allow us to finally verify the non-degeneracy of the 
S-matrix. 

Definition 18.8 Given an integer / G Z define the affine Weyl group W of st^C as 
the group of isometries of its Cartan subalgebra f) generated by reflections about the 
hyperplanes (x, a,) = kl' for every integer k £ Z and every simple root a,- (as before 
/' is / for / odd and 1/2 for / even). We also define the translated action of W as 
w ■ x := w(x + p) — p, where p is the Weyl weight. 
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Remark 18.9 Notice that we use both actions of elements of the Weyl group. In the 
Weyl character formula the usual action given by reflections perpendicular to simple 
roots is used. In the proof of Theorem 18.13 we use the translated action. We will 
always use the notation cr(A) for the usual action and a • (A) for the translated action. 

Setting k = in the definition we get the reflections Sj that generate the (ordinary) 
Weyl group W C W l . Another type of elements of W is obtained by performing the 
reflection about (x, a,) = followed by the reflection about 



This is easily seen to be the translation by I'cti and we identified the subgroup of 
translations l'A r C W . Every other reflection can be performed by translating the 
corresponding hyperplane to the origin, reflecting and then translating back. Thus we 
arrive at the presentation 



where in the semi-direct product W acts on l'A r by conjugation. In particular, l'A r is a 
normal subgroup of W l . 

Recall that a closed set D is a fundamental domain of a continuous group action if every 
orbit intersects it and no orbit intersects its interior more than once. The subgroup of 
translations comprises integer multiples of I' a for a G A + and therefore one of its 
fundamental domains is the polygon 



Of course, any translate of TL is also a fundamental domain of l'A r . Note that by 
definition the translated action of l'A r is the same as the usual one while TL — p is 
invariant under the translated action of W. The fundamental domain for the ordinary 
action of W is well known to be the Weyl chamber (see Humphreys [79] or Fulton and 
Harris [62]) 

A+ := {x £ f) | (x, a) > 0, for all a £ A + } 
and hence A + — p serves as a fundamental domain for the translated action. 

Exercise 18.10 Let G = A x B with B being a normal subgroup. Suppose Da, Db 
are fundamental domains of A and B respectively, and D# is invariant under A. Prove 
that D := Da H Db is a fundamental domain of G. 

Intersecting TL — p with A + — p (see Figure 18.3) we establish 




(66) 



W l = W k I'A 



H := {x e f) | (x, ±a) < l', for all a E A+}. 
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Figure 18.3: Affine Weyl group and C 



Corollary 18.11 The closed Weyl alcove 

c' = H n A + - p = {x £ f) I < (x + p, a) < I', for all a G A+}, 
is a fundamental domain for the translated action of the affine Weyl group W 1 on fj . 

After these geometric preliminaries let us move on to a simplification of the quantum 
diameter V 2 := J2\ei^\ - ^ rst note tnat 

hx = Sao = Tr 9 (x 0jA o x A>0 ) = Tr^(id A ) = d x . 
This suggests considering the square of the S-matrix, indeed 
(67) (5 2 ) 00 = ~ s ^~ s ^ = Y, d » = v2 - 

Non-degeneracy of the 5-matrix was originally established by Turaev and Wenzl in 
a roundabout way based on results of Kac and Petersen from the theory of affine Lie 
algebras [153]. The direct proof based on an explicit computation of s 1 that we present 
here is due to A Kirillov [21, 86]. As a bonus, it gives a nice formula for the quantum 
diameter. This proof requires the orthogonality relation for characters of finite groups 
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sketched in the next exercise. Another, purely algebraic proof is given by M Miiger 
[112]. 

Exercise 18.12 Define (4>, ip) := SgeG fiis^is)* f° r functions defined on an 
arbitrary finite group. Prove that (x\, Xn) = ^A/t- This is similar to Exercise D.36 from 
Appendix D. 

Theorem 18.13 Let I be an even integer. Then (s 2 )\^ = V 2 5\ ll * and 
(68) p2 = ( _ ir M^M > 

where wq is the order-reversing permutation and 5q is the Weyl denominator (92). In 
particular, V / and the s-matrix is non-degenerate. 



Proof We first transform expression (65) for s\ v using the Weyl character formula 
(93) 

~sxu = Xu(q 2{p+X) )dx = Xu(q 2ip+X) )Xx(q 2p ) 

6 u (q 2(p+X) )Sx(q 2p ) = 5 u (q 2(p+X) )Sx(q 2p ) 
(69) ~ 5 (q 2 (P+») 5 Q (q 2 P) ~ ' y 

1 V (_D<r f2lcr(X+p),v+p) 

So(q 2 ") ^ 

Notice that while s\ u is originally only defined for weights A , u in the Weyl alcove A l w , 
the right side of equation (69) makes sense for all weights. We extend the definition of 
Sai/ to all weights by equation (69). We can now replace the summation over K l w in 
(67) by summation over a more convenient set, the quotient group A w /l'A r . This is 
done in two steps. First, we know from Theorem 17.35 that the Weyl modules with 
the highest weights in C \C l have quantum dimensions and Exercise 17.44 implies 
~s\v = Tr q (x x ,,s o x^a) = for v G c'\C' so 

(s 2 )\p = V" S\uS. 



/ ; J M*VH — / J 



where a',, := A w . n c'. For the second step notice that the value of s\ v is only changed 
by the sign of the orientation of the transformation when A or v are acted on by elements 
of the affine Weyl group via the translated action. Indeed applying a translation /'a,- 
from the affine Weyl group to v does not change the value of s\ v since 

£ 2( CT (A+p), v+l'cci+p) = e 2(a(\+p),u+p) e 2(<T(\+p),l'ai) = ^,{a(X+p), u+p) 
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and oti pairs with any weight to give an integer and e 21 ' = 1 . Applying the translated 
action of an element of the Weyl group to A gives 

f , _ 1 V (A\<? f 2(a(T-\+p),v+ P ) 

\ ,2(<j(r(A+p)),f+p) 

n 2p\ K l) 



(70) 

£ (-D CT e 2(ff 



<W P ) 

( — 1) T / ,xrr 2(a(\+p),y+p) 



5o(q 2p ) a ... , 

By Corollary 18.11 and (70) summation over can be replaced by summation over 
equivalence classes in A w /W l and furthermore in view of (66) 

(71) ^ ^ S\vSv/j, = ^ ^ S\pSup = T7J77 ^ ] ^Xu^u/j,- 

ueA' tr ueK/w' ueA w /i'A r 

This gives 

(72) (3?V= 1 V V ( _i r -2 e 2( CTl (A + p) +CT2 (M+p),,+p). 

To simplify the last double sum we change the order of summation and look closer at 
the maps e 2< - K, '\ where k G A w . Since e 2/ ' = e' = 1 they are well-defined as maps 
A w /l'A r — > C* and in fact are characters of the Abelian group A w /l'A r . Since e is a 
primitive /th root of unity e 2(K ' ' ^ defines the trivial character if and only if (k, v) is 
divisible by 1/2 for any v G A w . Since / is even and I' = 1/2 this is equivalent to 
k G l'A r . The orthogonality relation for characters now yields 

ueKp'Ar ue\ w /i'A r [\A w /l'A r \, k e l'A r . 

This implies that the terms in the double sum (72) are unless a\(X + p) + (72(/i + p) G 
/'A r . But by Theorem 17.35: p + p = -w (p* + p*) = -wo(p* + p) since p* = p. 
Therefore, the following are equivalent. 

ai(X + p) + a 2 (p + p) G l'A r 

(Ji(A + p) - a 2 w (p* + p) G /'A r 

A + p G cxf 1 o- 2 w (/i* + p) + f A, =: ct(p* + p) + l'A r 

A G cr(p* + p) - p + f A r = a ■ p* + /'A r 

A = vv • p* , for w G 
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But A, /x* G A w C C 1 are within a fundamental domain of the translated action and the 
last condition can only be satisfied when A = /i* and w = 1 or equivalently o\ = 02 wo . 



In particular, (s 2 )\n = for A 7^ fi* . When A = [i* expression (72) reduces to 



1 r (— 1)**° I W| I A^/i'Ar I = (-1) W0 \ Aw ' lAr 



The formula for the quantum diameter now follows directly from (67). □ 

Using the formula for the Weyl denominator, we obtain the following formula for the 
quantum diameter that generalizes to any semisimple Lie algebra. 

V = i w °\A w /l'A r \ 1 ' 2 - e- (p ' a) y l . 

agA+ 

We can make this formula more explicit for stjyC. Recall that the level is k := I' — N . 
By (94) and Exercise 18.3 

7Ti(p,ajj) 7Ti(p,OLjj) 

5o(q ip ) = n - e-(p,Q) ) = n 2i - — 27 — 

a6A+ i<j 

t N(N—i) -| r . /ttO'-OA mN-p ^pj . / irj \ N ~ J 
= 1 2 2 sin = 1 2 2 sin 

11 I ^ + ^ J 11 \k + NJ 

i<j v ^ / 7=1 

Exercise 18.14 Prove that |A w /(jfc + A^)A r | = N(k + N) N ~ l , and show that (-l) w ° = 
(_ 1 )iV(JV-l)/2_ 

We arrive at the following. 

Corollary 18.15 The Chern-Simons partition function for the three-sphere is 
(73) Z cs (S 3 ) = V- 1 =N- 1 /\k + N)V-W 2 J] (2^^))^. 

Remark 18.16 When I is odd the level k = l/2 — Nis only a half-integer and Witten's 
heuristic argument for the invariance of Z cs breaks down. Thus, there is no 'physical' 
reason to expect that L^ es (s[;vC) produces topological invariants for e a primitive odd 
root of unity. The above proof of non-degeneracy of the S -matrix also fails for / odd. 
Indeed, it is sufficient that k G I A r as opposed to k G lA r = I' A,- for e 2(K ' ' * to define 
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the trivial character and we can not reduce the double sum completely. As a matter 
of fact, the S-matrix can be degenerate in this case. This is due to appearance of 
nontrivial transparent [35] (also called degenerate [136] or central [112]) simple objects 
in Tift e (s[/vC). These are the Weyl modules Y\)% such that R\ tT = R~ l x for all alcove 
weights A. Obviously, the trivial object is transparent in any ribbon category. The name 
comes from the fact that x x T = x t,a an d any strand can be pushed through r-colored 
ones without changing the invariant. As a result, transparent objects create a row s T \ 
in the S-matrix that is proportional to the row of the trivial object S A making the 
matrix degenerate. When / is odd and N is even the fundamental weight ujn indexes 

2 

a nontrivial transparent object in TiZi e (sl;vC). In particular, for sfeC reduced tilting 
modules at odd roots do not form a modular category (see Sawin [136]). Moreover, 
under mild technical assumptions one can prove that a category satisfying Axioms 1-16 
of (16.5) (such categories are called premodular) with V ^ is modular if and only if 
it does not have nontrivial transparent simple objects (see Bruguieres [35] and Miiger 
[1 12]). Surprisingly, Tift e (s[^C) can be 'modularized' by a quotient construction and 
made to produce nontrivial invariants even when / is odd except when N = 2 mod 4 
[136]. At present, a geometric or physical explanation for this phenomenon is lacking. 

We conclude with some remarks about computing the Reshetikhin-Turaev invariants for 
more general 3 -manifolds. There are two different ways that may be used to organize 
many such computations. The first method is via the shadow invariants of V Turaev 
[152]. The second method is to expand the theory into a full topological quantum field 
theory. Most computations in the physical literature use the topological quantum field 
theory viewpoint. 

Recall the twist matrix ~t\^ := e (2p+x,x ^5\^ from (63). For many 3-manifolds such 
as circle bundles over Riemann surfaces one can avoid using the R -matrices directly 
by utilizing the fact that renormalized s, t matrices form a projective representation 
of the modular group SL2Z that appears in many surgeries. Namely, set s := V~ l s 
and t := Q~ l ~t with £ := (p + /p~Y then s, t can be taken as the images of the standard 
generators S, T of SL2Z 




Exercise 18.17 Show that s, t satisfy the standard relations of SL2Z: (st) 3 = s 2 , s 2 t = 
ts 2 , s 4 = 1 , see Bakalov and Kirillov [21, 86]. 
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In terms of the s-matrix we have 

a -n s ° x 
d\ = Vs QX = — 

Z CS (S 3 ) = V- 1 = sou 

These identities are frequently used in practical computations. 

>We combine equations (69) and (68) in this subsection into the following corollary. 

Corollary 18.18 In any semisimple Lie algebra we have the following explicit formula 
for the s-matrix and t-matrix. 



^ = rl A +l|A w //'AV|-V2£ ( _i)V«" 



(A+p),v+p» 



Here is the coroot lattice and ((-, •)) is the Killing form normalized so that ((a, a}) = 2 
for short roots. 

Specializing to sI^C gives 

SXu =N- l / 2 (k + N)V-W\i)- NlN - l)/2 E (-\re 2{p+1/Mp+X)) . 

Remark 18.19 The sign in the exponent (j) - ^^ -1 )/ 2 differs from that of [21] since 
we are using the right-handed Hopf link to define the s-matrix (not the left-handed 
one.) The sign used here and the use of the right-handed Hopf link are correct. 

In the next section we compute the free energy of the 3 -sphere and perform the final 
comparison between the Gromov-Witten free energy and Chern-Simons free energies. 
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Part III 

Comparisons and recent developments 

19 Comparison of the free energies 



In this section we reach the final goal of this paper by comparing the Gromov-Witten 
and Chern-Simons free energies for and S 3 . We start by recalling some necessary 
facts and formulas from the theory of special functions that are used in transforming 
the expressions derived earlier in the paper. We apply these formulas to reduce both 
energies to a similar form and perform the comparison. Even though the energies do not 
match exactly the difference does not contain any positive powers of x := 2ir/(k + N) 
(called the string coupling constant and denoted g s in the physical literature) which 
means that we have an exact match for the counting invariants. 

19.1 Bernoulli numbers and special functions 

After generating functions for the invariants are computed the claim of the Gopakumar- 
Vafa Large N Duality reduces to a problem in the theory of special functions. Aside 
from Bernoulli numbers we need the Eisenstein functions, the Riemann zeta-function, 
polylogarithms and the Barnes function. Rather than simply referring the reader to 
various sources for necessary formulas we briefly review the definitions and relationship 
between them in this subsection. 

Since free energies are essentially the natural logarithms of partition functions it helps 
to present all terms in partition functions as products. In particular, the Chern-Simons 
partition function contains sines and we start by deriving the Euler product formula 
for the sine function. There are different ways to do this but we choose the one using 
Eisenstein functions since we need them later, see Weil [157]. 

Definition 19.1 The Eisenstein functions are defined by 



oo 



E k (z) :=v. V .J2 (z + ny k 



n=—oo 



where we use the Eisenstein convention: 



oo 



N 




n=— oo 



n=-N 
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The main properties of the Eisenstein functions are collected in the following exercises: 



Exercise 19.2 Show that E\ (z) — ir cot(7rz) is a bounded entire function that evaluates 
1 

2 



to zero at z = \ and use Liouville's theorem to conclude that it is identically zero. 



Exercise 19.3 From the definition, it follows that E' k (z) = — &Ejt+.i(z), so all of these 
functions may be found by differentiating the cotangent function. Using this show that 
Ez(z) = 7r 2 csc 2 (7rz) and £ 3 (z) = E\{z)E 2 {z). 

Exercise 19.4 Set s(z) = z \[™ =l (1 - and prove that s'(z)/s(z) = Ei{z). 

Exercise 19.5 Express ^(j(z) 2 £a(z)) using just s{z), £f(z) and £2(2)- Conclude that 
it is zero. 

It is not hard to see that s(0) = and s'(0) = 1 . It follows that 

hm = 1 . 

z-^o sin ttz 

However we know from Exercise 19.5 that is constant. Thus 

00 7 
z 



n 2 



sin(7rz) = ttz Y\ y 1 

n=\ 

To transform multiple sums that appear in the Chern-Simons free energy we need closed 
formulas for power sums of natural numbers. These formulas involve the Bernoulli 
numbers. 

Definition 19.6 The Bernoulli numbers Bu are defined by their generating function: 

zl 



x k 



k=0 



Again we collect the main properties we need in an exercise, see Andrews-Askey-Roy 
[11]. 

Exercise 19.7 Show that I coth(|) = f + -Jiz\ an d conclude that B\ = — \ and the 
rest of the odd Bernoulli numbers are zero. Also compute Bo, B2, B4 and B&. 

Exercise 19.8 Expand c - as a sum of exponentials, and then expand the 

exponentials into power series to obtain + Y1T=\ Y^!j=\ Fjj. ■ 
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Exercise 19.9 Expand each factor of 1 in a power series and multiply the two 
resulting power series to obtain 



EE " )»> 



J J J n-k+ln\ 

n=Q j=Q v 7 

Exercise 19.10 Compare the expansions from the two previous exercises to obtain a 
formula for the sum of powers of the first N — I natural numbers. Use the fact that all 
of the odd Bernoulli numbers other than B\ vanish together with the definition of the 
binomial coefficients to conclude 

U 2 &V U J 2p-2k + 2> 

j=i ~~ 4=0 



2k J 2p+\ 



Bernoulli numbers are closely related to values of the celebrated Riemann zeta function 
at even integers. 

Definition 19.11 The Riemann zeta function is defined by (Andrews-Askey-Roy 
[11]): 

1 f°° u z ~ l ^ 



n~ z . 



n=\ 

where T{z) is the usual gamma function of Euler 

/>oo 

r( Z ):= / e -'t z - l dt. 

Jo 

Exercise 19.12 Use integration by parts to derive the usual relation between the gamma 
function and the factorial. Expand (e u — 1) _1 in powers of e~ u and substitute into 
the definition of the zeta function to obtain the formula Q(z) = Y^=\ n Z - Use the 
generating function definition of the Bernoulli numbers in the definition of the zeta 
function to obtain for n > 1 , 

(74) a2 n) = (-ir +l( ^^. 

2(2n)\ 

The gamma function can be meromorphically extended to the entire complex plane via 
T(z + 1) = zT(z) and the zeta function can be meromorphically extended to the entire 
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complex plane via the functional equation discovered for the zeta function by Riemann 
himself (see Hardy and Wright [72]): 

C(l - k) = 2(2tt)-* cos(7rfc/2)r(jfc)C(£) . 

The Chern-Simons partition function contains a factor that can be identified with the 
volume of SU(iV). This volume can be expressed in terms of the Barnes function. 

Definition 19.13 The Barnes function is defined by 

oo 

G 2 (z + 1) = (2 ^/ V(*+ 1)+ ^ 2 )/ 2 J] (d + z/kfe<*-*l*) , 

k=i 

where 7 is the Euler constant given by 7 := lim„^o {J2'l=i — mn ) • 

Remark 19.14 For integers greater than 1 one can show that CfaiN) := nj=i 2 j' ■ 

The following asymptotic expansion for G2 in terms of Bernoulli numbers is given by 
Adamchik [2] : 

(75) ln{G 2 {N + 1)) = 

00 g 

\N 2 InN -\N 2 -^ InN - N('(0) + C'(-l) + £ ^^—^^ 

Remark 19.15 The formula in [2] has a negative sign in front of the sum. Careful 
inspection shows that a sign was lost when the expression following line (29) in this 
paper was substituted into equation (20) from this paper. 

Finally, manipulations with the Gromov-Witten free energy require the use of polyloga- 
rithms. 

Definition 19.16 The poly logarithm functions are defined by (see eg [42]) 

00 

(76) Li p (z):=^rt". 

n=\ 

One can see by inspection that Lii(z) = — ln(l — z)and z^Li p+ i(z) = Li p (z). Therefore, 
polylogarithms indexed by positive integers have a logarithmic branch point at z = 1 - 
On the other hand, polylogarithms with negative integers are meromorphic in the 
complex plane and relate to the Eisenstein functions by a change of variables and 
renormalization. 
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Exercise 19.17 Expand 7rcot(7rz) as a power series in the exponential e~ 2n,z (see 
[1 1]) and differentiate the expression obtained by combining this with Exercise 19.2 
and the definition of the polylogarithm to obtain for q > 1 

n=—co 

We will need power series expansions for Li3_2g(e~0 at 0. Note also that these 
functions are manifestly periodic with the period 2m. For g > 2 combining the last 
exercise with the negative power binomial theorem gives 

,_ 2 „ (2g-3)\ x 



Ii 3 _ 2jr (a-') = (2g-3y.t 2 - 2 z+ "* >: 2 (l+(?/2vrm)) 2 - 2 V- 2 « 

ft = — OO 

= (2g-3) It 2 - 2 * 

(77) + (2,-3)! ± ± ( 1S+ l') { -t)\2,i) 2 - 2 ^n 2 - 2 ^ 

„^o h=0 ^ ' 
n=— oo 

= (2£-3)!f 2 ^ 
+ (2g-3)! £ 2^+ /j - 3 Vo ,! (2vr0 2 - 2 ^ ,1 C(2,+/ J -2) . 

A even 
/j>0 

As mentioned above the functions Lii(e -f ) and Li3(e~') have a branch point at f = 
and can not be expanded into a Taylor series. However, this is easy to fix by adding 
logarithmic 'counterterms' that render them holomorphic in a neighborhood of 0. For 
instance, Lii(e~') = — ln(l — e~ ! ) behaves like — In t near so the sum Lii(e~0 + In t 
is holomorphic. Of course, by writing In t we are implicitly fixing a branch of the 
logarithm and thus the polylogarithm as well. The corresponding expansions are derived 
in the next lemma. 

Lemma 19.18 The polylogarithms admit the following series expansions: 

(78) Li 1 (e- t ) + ]nt = t/2+ V -(2<r m C(ra)(/f) m , 

m 

m even 
m>2 

(79) Lhie-') + - In* = C(3) - ((2)t + 3t 2 /4 + P/12 

2 



E - ( TV ^(2vr) 2 - m C(m-2)(/0 m . 

m(m — l)(m — 2) 



m even 
m>4 
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Proof The function f(t) = ln(f) + Li \{e , ) = \n(t(\—e ') 1 ) is clearly holomorphic 
at zero. Now compute 



1 oo 

-E 



oo 

e 2« 2n-l 



2 ^ (2n)! 

Since /(0) = we can integrate and use formula (74) relating the Bernoulli numbers to 
the zeta function see that 

JKJ 2 Z^(2n)!(2n) 

n=l 

z ^— ' n 

For the second equality set g(t) = Li3(e~') + jt 2 \nt — \t 2 . One easily computes 
8(0) = C(3), 

g'(f) = -Li 2 (e" f ) + ?lnf-f, 

giving g'(0) = -C(2) and 

*"(*) = LiKe-') + In? = \t + V -(27r)- m C(m)aO m - 

m even 
m>2 

Integrating twice as above with the computed constants of integration, reindexing the 
sum with h = m + 2 and using the definition of g(t) gives 

Li 3 (e- ? ) + \t 2 In f = CO) - C(2)? + ^ 2 + i^ 3 

- V (2ir) 2 - h ((h - 2)(if) h . 

h even 

/»>4 

This completes the proof. □ 



19.2 The Chern-Simons free energy 

Recall from equation (73) that the SL^AO-Witten-Reshetikhin-Turaev invariant of 5 3 
is given by 

Z(S 3 ) = rf~ c (5 3 ) = N-^ik + Nf-^ 2 ]! 

7=1 



2 sin 

\k + NJ 
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Recall that the Chern-Simons free energy is given by 

F M = ln(Z u W(M)/Z U(/V) (M)). 

The formal mathematical definition of the U(N) partition function has not yet been 
agreed upon. In addition there is no mathematical definition of Zq^ n \M). To check 
Large N Duality we define the unnormalized Chern-Simons free energy to be the 
logarithm of the SU(AO-Witten-Reshetikhin-Turaev invariant. 

Definition 19.19 The unnormalized Chern-Simons free energy is 

F<=? = ln(rf nC (M)) . 



It follows immediately that 

1 — N N ~ l 
=— r - \n{k + N)-\\nN+ - j) In 

(80) =— — \n{k + N)-\\nN+^ N -^\T^j 

N—\ / oo .2 



2 sin 



k + N 



7=1 \ != i 

Here we have used the product expansion of the sine function derived just below 
Exercises 19.2-19.5. We will now concentrate on the last sum in this expression. We 
call it the perturbative Chern-Simons free energy, and denote it by F pert . Using the 
coupling constant x = lit jik + N) , replace k + N in F^^ , expand the logarithm in a 
series (ln(l — z) = — z — \z 2 — 5Z 3 — ■ ■ ■ ) and use ((s) = YlnLi n ~ s to obtain 

N— 1 / oo 2 '2 



/v — i / oo 2 *2 \ 

^=e^(e>-(i-s?) 

7=1 \i=l 7 

A'" 1 / oo oo op 2p v N-l , oo op 2p \ 

= -£c*-/> (£ E ) = -£c*-/> ( £ 4^« 2 ^ ) ■ 

7=1 Vi P =i ' 7 7=1 >=i ' 7 

Recall from Section 15 on perturbative Chern-Simons theory that we expect based 
on intuition from the path integral that the free energy should have an interesting 't 
Hooft expression of the form F = Ylh^^^^Fgfi- This motivates our next 
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manipulations. Applying the two formulas derived in Exercise 19.10 gives 

oo p 



CO . 

P =i p 

y^\2k )p{2p+\) y 



p=\ k=0 
co . 

+Y.Y {l7ir2pa2p)N2p+lx2p 

P =i p 

co p 



=EE(^2t ') P «^^X + ^ rW-^. 

p=l k=0 

We now re-index the double sum setting g = k and h = 2p — 2k + 2 . Notice that 
2p + 1\ 2Jfe - 1 / 2p - 1 \ 1 



2£ / p(2p + l)(2p -2k + 2) \2p-2k + 2J k(2k - 2) 

'2g + h-3\ 1 



for g > 1 . We conclude that 



■h2g-2+h 



hh-2 



^ rt =EE ( 2g+ r 3 )^^) (27r)2 ~ 2 " ftc(2 ^ 2+wv 

g— 2 /i even 

+ E ^r'ccwv- £ «sr^r2)0') 2 -"«'-^ 

h even A even 

/z>2 h>4 

The second sum in the above expression is the g = 1 term and the third sum is the 
g = term. In the original paper of Gopakumar and Vafa the last two sums were 
rewritten using polylogarithm identities (79) and (78) (see Gopakumar and Vafa [65, 
(3.7), (3.10)]). We prefer to keep them as they are and simplify the polylogarithms on 
the Gromov-Witten side instead. 

The term 

E<»-;>4^ -^>-+E<»-;w. 
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from the Chern-Simons free energy is called the Barnes term [65]. This is because one 
has 

N- 1 ,N- 1 v 

^(N -J) lnj = lnf J = ln(G 2 (/V + 1)). 

7=1 V ;=i J 

We use the asymptotic expansion (75) for the Barnes function to analyze this term in 
the final comparison. 



19.3 The Gromov-Witten free energy 



Recall from Section 12 that the restricted Gromov-Witten free energy is 



(82) ^J = EE^ 2s " v 



td 

8=0d=1 ■ dy V 2 _ fd ~ 1 a ,rfy 

<2=1 d=l 



where ? is assumed to be in the right half -plane for the series to converge. 



-td 



Remark 19.20 The formula derived in Exercise 19.12 allows one to compute the signs 
of the Bernoulli numbers. Using these signs one can check that the following formulas 
that we give for the Gromov-Witten invariants agree with the formulas from Faber and 
Pandharipande [55]. 
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We can rewrite equation (82) using the polylogarithms (19.16) and Exercise 19.7, 



,fe" 3 *t- (2s)! j 

oo 

(84) = ((-1)^" 1 (2^ - l)S 2 ,Li 3 _ 2 ,(^ f )/(2^)!) A 

g=0 

Comparing (83) with the definition of the Gromov-Witten free energy (82) we see that 

B 2g 



(85) =E^V 



N g4 {X s ,) = d 2 ^\-\f-\2g-\) 



<2g)\ 



The last line in the above equation serves to define the genus g contribution to the 
restricted Gromov-Witten energy. In general the expansion of the free energy can be 
expressed as a sum of polylogarithms. The result is described in the following exercise. 



Exercise 19.21 Expand the p = term in the following expression as we did above, 
then express the sines in the remaining terms using exponentials and simplify with the 
binomial formula 

oo oo 1 , , x 2p-2 

p=0 (3 d=l ^ ' 
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The answer you should get is 



+g xfo-y ct 2 ) (i -^;;;r s - i) )u3-, 8 ( e -^))^-'. 

Combining the polylogarithm formula for the Gromov-Witten free energy (84) with the 
power series expansion of Li3_2 g for g > 2 (77) gives 

(86) ^/ = —^—{it) 2 ~^ 
8 2g(2g-2) K 



h even 
h>0 

We next need to consider the g = 1 and g = terms. The polylogarithm formula 
together with Lemma 19.18 gives 

(87) Ff s3 = t/24 - i Inf + £ ^(27r)- k C(W)* • 



and 



/i even 
h>2 



t 2 

(88) F^ 3 = CO) - C(2)t + 3t 2 /4 + f 3 /12 - - In t 

-Y, h(h _ i Kh _ 2) (^) 2 -*c(ft - 2Xio*. 

/i even 

Note that in the definition of the Gromov-Witten free energy (82) the sum over degrees 
begins with d = and we have not included d = . The degree are constant maps 
and there is a question as to whether they should be included. The answer turns out to 
be 'yes'. Indeed, notice that the second term in (86) is almost completely identical to 
the first term in (81). The only difference is that summation there starts at h = 2 as 
opposed to h = 0. The extra h = term reads 



B 2g 



-(27T) 2 - 2 ^(2g - 2). 



8(2g - 2) 

Using the result from Exercise 19.12 together with the fact that XCX53) = 2 we compute 

<2,r\(2 8 - 2) - ( - 1)g(2 * - 1)S2 ^- 2 xiX ^ 



g(2g-2y 2{2g-2){2g)\ 
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But for g > 2 this is exactly the negative of the degree zero invariants N gt o(X) computed 
in equation (19)! Genus and 1 contributions line up as well. There are no contracted 
genus zero or one stable curves fixed by the torus action so Afyo = ^Vi,o = 0. Combining 
the formulas we get the degree zero term 

(89) Ng o = - n B2g -, (27r) 2 - 2 gC(2g - 2) 

g(2g ~ 2) 

that exactly cancels the extra h = term. In other words, a major discrepancy between 
the Gromov-Witten and the Chern-Simons free energies takes care of itself if we 
include the degree zero terms (as we should have from the beginning). Also note that 
the sum over genus would diverge without the cancelation afforded by the degree zero 
term. This is why the definition of the Gromov-Witten free energy includes the constant 
terms. 



As we explained F\ = F\, Fq = Fq and (86) turns into 

h even 
h>2 

We are all set for the final comparison of the free energies on both sides of the duality. 



19.4 The final comparison 



As we warned in the introduction the match between the Gromov-Witten and the 
Chern-Simons free energies will not be exact. The discrepancy may be due to the fact 
that as physicists insist, we should really consider the U(/V) not SU(/V) Chern-Simons 
theory which is expected to insert some additional normalizing factors into the partition 
function (see Marino [104, 106]). Combining equations (80), (81)) and the asymptotic 
expansion for the Barnes term (75) gives the following expression for the unnormalized 
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Chern-Simons free energy, 

F^ 3 s (N,x) = \N(N - 1) In* + i(l - AO ln(k + N) + \N 2 \nN 
-\\nN- \N 2 - i InJV - C'(0)iV + C'(-l) 

- V - (2tt) 2 - /, C(/j - 2)N h x h - 2 

^ h(h-l)(h-2y ' s 

h even 
h>4 

/i even 
h>2 

+E E f^^marw^" 

g— 2 ft even 
oo _ 

^ 2^(2^ - 2) 

In the same way we combined terms to get the Chern-Simons free energy, Definition 12. 1 
and equations (88), (87) and (90) give the following expression for the full Gromov- 
Witten free energy of the resolved conifold, 

Fxgfry) =^-12^ + C(3)?- 2 - C(2)ty- 2 + 3t 2 y- 2 /4 
+ t i y- 2 /\2- \t 2 y- 2 \nt 

h even 
h>4 

+ E ^r h amtf 

h even 
h>2 

+ E^2) E ( 2g+ ^ 3 )(2 7 r) 2 - 2 -"C(2^-2)(^)V- 2 

g— 2 /i even 

A>2 



Z- 2 g(2g-2) W 



g=2 

Note that some of the extra terms that appeared 'on the Chern-Simons side' in the 
original paper [65] show up 'on the Gromov-Witten side' with opposite signs in our 
presentation. This is because we chose not to reexpress the genus and 1 contributions 
in the Chern-Simons free energy via the polylogarithm identities. By inspection, 
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under the substitution it = ±Nx and y = x all the infinite sums match exactly. In 
light of the complicated definitions and expressions for the free energies this is a 
remarkable coincidence. Notice that the sums represent exactly the perturbative part 
of the Chern-Simons free energy and thus contain all of the information about the 
perturbative invariants. 

Analytically, we are comparing series expansions of two functions near the origin (f , y) = 
(0, 0). It may seem odd that we should choose the origin since (t,y) = (fr^r, jq^) 
converges to (2m, 0) at large N. However, as one can see for example from (82) the 
free energy is periodic in t with period 2tti so the coefficients are the same as the 
coefficients at the origin. Another issue is that originally in (82) we assumed the real 
part of t to be positive. The problem with analytically continuing the free energies to a 
punctured neighborhood of the origin is that the logarithmic terms in both expressions 
are ambiguous. However, for us the free energies are just a bookkeeping device for the 
invariants on both sides of the duality. Since logarithmic and other mismatching terms 
outside the infinite sums carry no apparent geometric information they do not pose a 
serious problem. Finally, notice that the infinite sums are real- valued which allows us 
to package the comparison into the following nice form. 

Theorem 19.22 The full Gromov-Witten free energy and the unnormalized Chem- 
Simons free energy are related by 

Re(F$J(iNx,x) - F^(N,x)) = ^ lnx + C(3)x~ 2 - \ ln(2vr) - C'(-l) . 

Proof Combining the expressions for the free energies gives, 

Re(F™(iNx,x)) -F^ 3 s (N,x) = 
\(N-l) ln(k+N) - ±N(N- 1 ) In x- \N 2 In N+3N 2 /4+ ^ In N+('(0)N- C'(- 1 ) 

-Re ( i ln(aAO) +C(3>T 2 - \ N 2 + \N 2 Re(ln(ixN)) . 

Using the fact that x = to write ln(& + N) and the value C'(0) = —\ ln(27r) from 
Sondow [147] allows one to combine like terms further to obtain the result. □ 

We conclude our presentation of the Gopakumar-Vafa duality with this remarkable 
equality. 

Remark 19.23 Large ,/V duality is said to be exact when the full free energies are equal. 
It is said to hold to a leading order when the genus zero terms agree. Given this the term 
((3)x~ 2 in the comparison theorem is slightly disturbing. This term prevents the duality 
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from holding at the level of the genus zero contributions. In the physical theory this 
term is canceled by additional genus zero corrections in degree zero. Ooguri and Vafa 
[122] obtained a perfect agreement of the two sides using the physical normalization of 
the S Chern-Simons free energy (which can only be computed on a case by case basis 
comparing exact answers to perturbative expansions). Thus, we expect that one will 
obtain an exact correspondence after enough examples have been computed to find a 
general form of the correct normalization. 

20 New results (2003-2006) 

In this section we describe some recent directions of research inspired by the Large 
N Duality and discuss some difficulties and open problems encountered within them. 
Obviously this account is biased by our background and interests, and we apologize in 
advance for any inaccuracies and/or omissions. As in the history Section 1 the dates 
in the text refer to arxiv submissions while references are given wherever possible to 
journal publications. 

20.1 Computations of the Gromov-Witten invariants 

Computational verification of the Gopakumar-Vafa Large N Duality depends largely 
on one's ability to compute the Gromov-Witten invariants for as large a class of 
threefolds as possible. Toric threefolds seem to be natural candidates to start with since 
holomorphic torus actions are a part of their definition and the full power of virtual 
localization can be applied. However, as the sample computations in Section 7.3 show, 
the complexity of expressions obtained through virtual localization often grows very 
rapidly with degree and genus and quickly becomes unmanageable. 

Aganagic, Marino and Vafa introduced an interesting way to attack this computation 
for local toric Fano surfaces [5]. Iqbal found a nice reformulation of these results [80] 
and Zhou gave a mathematical proof of the results. This Aganagic, Marino and Vafa 
paper led to a breakthrough by Aganagic, Klemm, Marino and Vafa in [3], where an 
effective algorithm was offered that produces explicit combinatorial answers (without 
Hodge integrals, etc) for all toric Calabi-Yau threefolds. The idea is that any toric 
Calabi-Yau threefold (which is necessarily non-compact) can be presented by a labeled 
planar trivalent graph that can be cut into trivalent vertices 'with legs' representing C 3 
patches. Labels on the edges provide the gluing data that specifies the threefold. The 
topological vertex is an explicit function ^j(A) of the edge labels at each vertex, three 
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partitions (/j}, /x 2 , fi 3 ) and three integers (n 1 , n 2 , n 3 ) associated to each vertex of the 
graph. The generating function of the Gromov-Witten invariants of the threefold can 
then be written as a 'state sum' of these Cng(X) taken over additional labelings. The 
authors of [3] provided an explicit combinatorial expression for CW(A) based on a 
derivation assuming large N duality. 

This algorithm has been almost proved mathematically by Li, Liu, Liu and Zhou [95] 
based on gluing formulas for relative Gromov-Witten invariants. However, mathematical 
redefinition leads to a seemingly different expression Cn #(A) for the topological vertex. 
The equality 

has been verified for the case when one of the partitions fj,' is empty or when all 
partitions have length < 6 but the general case remains open. 

Another interesting class of Calabi-Yau threefolds is given by local curves. Those are 
the total spaces of rank 2 complex vector bundles N over a complex curve £ with 
ci(N) = 2g(£) — 2, for example the resolved conifold is a local CP 1 . Although not 
toric in general these threefolds always admit 'degenerate' holomorphic torus actions 
that leave the entire zero section fixed (as opposed to isolated points in the usual case). 
In [36] J Bryan and R Pandharipande used relative Gromov-Witten invariants and the 
TQFT approach to construct a recursive algorithm that computes the invariants of any 
local curve. 

Next in complexity is the case of local surfaces, that is, total spaces of canonical bundles 
Ks to complex surfaces S. The work of D-E Diaconescu, B Florea and others on the 
invariants of local del Pezzo surfaces [46] culminated in the joint work with N Saulina 
[48] that extends the toric topological algorithm to the case of local ruled surfaces with a 
finite number of reducible fibers. As in the case of local curves the authors make use of 
degenerate torus actions that fix finitely many curves and augment the toric formalism 
by the corresponding correction terms. The derivation of the combinatorial formulae 
uses physical arguments as in [3] and mathematical justification of the ruled vertex is 
an open problem. Another open problem is to generalize this algorithm to arbitrary 
Calabi-Yau threefolds with degenerate torus actions. 

From the point of view of Large N Duality it also important to understand the pre-duals 
of the above threefolds, that is, analogs of T*S 3 for the resolved conifold and identify 
the correct pre-dual theories. In the known examples as originally considered by 
Aganagic and Vafa [7] several 2-cycles are collapsed and then replaced by Lagrangian 
3 -cycles via resolving a singular deformation. These pre-duals are thus of a more 
general form than T*M [5, 46]. The corresponding theories combine elements of both 
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the Chern-Simons and the Gromov-Witten theories in agreement with Witten's original 
idea that the Chern-Simons theory on M is the correct 'Gromov-Witten theory' on 
T*M (see Witten [161] and Grassi and Rossi [67, Appendix 9]). It was the formalism 
from [5] that led to the discovery of the topological vertex. 

20.2 Intermediate theories 

In this section we discuss some theories that have recently emerged and could provide a 
bridge between the two sides of the large N duality. The most developed of these is the 
Donaldson-Thomas theory that has already been used to prove some of the duality's 
structural predictions. 

Gopakumar and Vafa predicted in [65] that the properly normalized partition function 
of the Gromov-Witten invariants Zx(\, v) on a Calabi-Yau threefold X is a rational 
function of the variable q = —e' x and expands into a series in q with integral coefficients 
(BPS states or Gopakumar-Vafa invariants). Intuitively the integers should correspond 
to counts of embedded curves in X. Classically, embedded curves are described by 
ideal sheaves on X, that is, torsion-free rank one sheaves with trivial determinants. S 
Donaldson and R Thomas introduced in [50] a new class of invariants N Xi p that count 
the number of ideal sheaves with a given holomorphic Euler characteristic x an d the 
associated curve class f3. The Donaldson-Thomas theory is 'better' than the Gromov- 
Witten theory in the sense that no orbifolds occur as moduli spaces and the numbers N Xj p 
are integers. In [107, 108] D Maulik, N Nekrasov, A Okounkov and R Pandharipande 
conjectured that the Donaldson-Thomas partition function Zx(q, v) = Yl x p ^x,P^ Xv 
turns into Zx(\, v) after the change of variables q = —e lX . This automatically implies 
the rationality and integrality predictions for all Calabi-Yau threefolds. Moreover, the 
authors offered the 'equivariant vertex' algorithm analogous to the topological vertex for 
computing the Donaldson-Thomas invariants. Combined with [95] and [36] their result 
proves the Gromov-Witten/Donaldson-Thomas duality for toric Calabi-Yau threefolds 
and local curves respectively. Obviously it is desirable to extend the duality to local 
surfaces and more general threefolds. 

It is expected that the Donaldson-Thomas theory admits a gauge-theoretic interpretation 
and if so it could serve as a link in a chain mathematically connecting Gromov-Witten 
theory to Chern-Simons theory. In particular there are some promising connections 
discovered between the BPS states and Yang-Mills theory in two dimensions [6] and in 
four dimensions [120]. 

Another possible intermediary is the symplectic field theory (SFT) of Y Eliashberg, A 
Givental and H Hofer [53]. In general SFT studies invariants of a contact manifold 
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C by considering moduli of pseudoholomorphic curves in its symplectization Cxi. 
One can naturally associate a contact manifold to any 3 -dimensional manifold M, 
namely the cosphere bundle S(T*M) with symplectization S(T*M) x R ~ T*M\M. 
Unlike T*M itself T*M\M does admit nontrivial pseudoholomorphic curves that may 
serve as Witten's 'instantons at infinity' [161]. Another attractive trait of SFT is that 
it reconstructs some knot invariants, for example the Alexander polynomial, from the 
Gromov-Witten invariants (see L Ng's paper [117] in this volume). The main challenge 
in applying SFT to Large N Duality is the scarcity of effective algorithms for computing 
the invariants. 

We should also mention an older approach to proving large N duality suggested by B 
Acharya [1] and developed by M Atiyah, J Maldacena and C Vafa [16] (see also Grassi 
and Rossi [67]) by lifting both sides to the M-theory on a 7-dimensional manifold 
with G2 holonomy. However, so far the M-theory approach (M for mystery) has not 
been very fruitful mathematically because the geometry of G2 manifolds is much less 
understood than that of Calabi-Yau threefolds. 

20.3 Construction of large N duals 

There are very few known large N dual pairs despite the large number of known 
Calabi-Yau threefolds with computable Gromov-Witten invariants. For a while after 
1998 the original T*S 3 /0(-l) © O(-l) example remained the only one. In 2001 F 
Cachazo, K Intriligator and C Vafa constructed a family of examples [37] with the 
deformed conifold T*S 3 replaced by the following hypersurface in C 4 : 

W\xf +f(x) + y 2 + z 2 + w 2 = 0. 

Here W(x) , f(x) are polynomials of degrees n, n — 2 respectively and the deformed 
conifold is recovered for W(x) = x 2 /2,f = const. However, they do not provide new 
examples of the form T*M, which is where it is easiest to compute the Chern-Simons 
side. In fact, the only known examples of this form come from the spherical quotients. 

The first published version of Large N Duality for the lens spaces L(p, 1) = S 3 /l* p 
appeared in Giveon, Kehagias and Partouche[64] (see also Marino [4] for other credits). 
The idea is to extend the group F = "L p action to T*S 3 , pull it through the transition 
to 0(— 1) © 0(— 1) and then resolve the resulting quotient singularity (see Halmagyi, 
Okuda and Yasnov [70] for details). The dual quotient is a bundle over CP 1 fibered 
by surface singularities C 2 /F and one can obtain a threefold resolution by resolving 
the surface singularities in each fiber. For the resolved quotient to be Calabi-Yau the 
resolution must be crepant (see Harris [73]). This restricts the list of groups F to finite 
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subgroups of SU(2) (in particular this is why the lens spaces L(p, q) with q ^ 1 are 
out). The geometry of the transition for such quotients and more general fibrations 
of surface ADE-singularities that also include the [37] examples is studied in [38]. 
One may be able to lift the SU(2) restriction by considering orbifold Gromov-Witten 
invariants. The crepant resolution conjecture proved for the affine ADE-singularities 
by F Perroni [123] says roughly that the orbifold Gromov-Witten invariants of a variety 
are equal to the ordinary Gromov-Witten invariants of its crepant resolution (when such 
exists). Thus conjecturally Large N Duality for the quotients with Y C SO(4) should 
relate the Chern-Simons invariants of S 3 /Y to the orbifold Gromov-Witten invariants 

of C(-i)eO(-i)/r. 

Computationally only the L(p, 1) example has been considered so far [4, 70]. Here the 
Chern-Simons partition function is known (see for example Hansen and Takata [71], 
Garoufalidis and Marino [63] and Turaev [152]) and the Gromov-Witten free energy is 
computable via the topological vertex since the dual is toric. The duality statement is 
more complicated than in the S 3 case and one has to split the Chern-Simons partition 
function into contributions from different (gauge classes of) flat connections before 
comparing to the Gromov-Witten side [4]. In addition to the analysis in [4], there is an 
indirect check of the duality based on mirror symmetry in [70]. In general, the precise 
meaning of the duality for more general manifolds remains an open question. 

This justifies interest in explicit computations of the LMO invariant (see Le, Murakami 
and Ohtsuki [94] and Bar-Natan, Garoufalidis, Rozansky and Thurston [26]) that is 
believed to capture the contribution of the trivial connection into the full Chern-Simons 
partition function. M Marino showed in [105] using physical considerations related 
to mirror symmetry that the LMO invariant can be expressed as a perturbed Gaussian 
matrix integral for Seifert fiber spaces. This reduces the computation to a solvable 
matrix model (see Aganagic, Klemm, Marino and Vafa [4], Fiorenza and Murri [57] and 
Marino [106]). The paper [105] also presented some sample computations. Recently S 
Garoufalidis and M Marino [63] gave a mathematical derivation of the matrix model for 
general rational homology spheres based on the Erhus integral presentation of the LMO 
invariant [26]. An interesting open question is just how much of the Gromov-Witten 
theory on the dual can be recovered from the LMO invariant. 

There is also a duality involving SO(N) or Sp(N) Chern-Simons theories discussed in 
the physics literature [145, 47, 31, 32]. 

Finally we mention the symplectic surgery approach of I Smith and R Thomas to 
constructing large N duals [146]. Their work suggests that in many cases such duals 
ought to be 'non-Kahler Calabi-Yaus', that is, non-Kahler symplectic manifolds X with 
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c\{X) = 0. If so, this explains why so few duals to cotangent bundles are known despite 
the abundance of known Kahler Calabi-Yau threefolds. 
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Part IV 

Appendices 

A Stacks 

In this section we follow the excellent exposition covering stacks in Metzler [1 10], and 
just add a couple of motivating examples. Stacks were introduced to encode the structure 
of an orbifold in the category of schemes, but may also be used to define orbifolds 
in the smooth, topological and analytic categories. We will provide examples in the 
smooth category. The main geometric objects that we consider may be represented by a 
category with additional structure, a contravariant functor or a covariant functor. We 
will conclude this section with a list of properties that such a covariant functor satisfies 
if and only if it comes from an object of the category in a natural way. The generalized 
objects that are used in Gromov-Witten theory are just covariant functors that satisfy a 
subset of these properties. 

Recall that an orbifold is a space locally modeled on the quotient of W 1 by a finite 
group action together with additional data to measure the stabilizer subgroups. As a first 
example, consider the quotient of C by the natural left action of Z3 by multiplication 
by cube roots of unity. We can encode this as a category X with objects Ob(X) := C 
and arrows Ar(X) := Z3 x C where we consider (to, z) € Mor(z, loz). The underlying 
space of the associated orbifold is the quotient of the objects obtained by identifying 
those objects connected by a morphism. The stabilizer group of a point z £ Ob(X) is 
just Mor(z, z). This is a special category because every morphism has an inverse. Such 
a category is called a groupoid. In fact this has the structure of a smooth groupoid. 
The structure maps in this example are given by (source - s : Ar(X) — > Ob(X) , 
s(lo,z) = z; target - ?: Ar(X) -> Ob(X), t(co,z) = loz; inverse - i: Ar(X) ->■ Ar(X), 
i(io,z) = (lo~ 1 ,loz); composition - m: Mor(u;z, 6 loz) x Mor(z,Loz) — > Mor(z, 6 loz), 
m(f,g) = f°g) 

Definition A.l A smooth groupoid is a category X with invertible morphisms such 
that Ob(X) and Ar(X) are smooth manifolds and the various structure maps are smooth. 

One good example to keep in mind is the smooth groupoid associated to any atlas on a 
smooth manifold. Given an atlas, A = {ip a : U a — ► V a } define a smooth groupoid 
with Ob(X A ) := \} a V a and Ar(X A ) := ]} a „ tp a (U a n Up) with the obvious structure 
maps and smooth structures. 
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Exercise A.2 Combine the example of the Z3 quotient of C with the smooth groupoid 
associated to an atlas to define a smooth groupoid modeling an orbifold with underlying 
space homeomorphic to S 2 , one point with stabilizer Z3 , one point with stabilizer Z2 
and the rest of the points having trivial stabilizer. 

We now turn to the second way to encode a geometric object - a contravariant functor. 
Let DIFF be the category of smooth manifolds and SET be the category of sets. Given 
a smooth manifold M, we define a contravariant functor M: DIFF ==> SET by 
M(N) := C°°(N,M) &ndM(f: N -► P) :=f* : C°°(P,M) -»• C°°(N,M). It is possible 
to reconstruct the original manifold (up to diffeomorphism) from the associated functor. 
We therefore think of a contravariant functor M. : DIFF =>• SET as a generalized 
manifold. (Recall that we described the moduli stack as a contravariant functor from 
SCHEME to SET.) However, we will have to add some restrictions in order to have a 
reasonable family of generalizations. When we add these restrictions to give the formal 
definition of a stack we will use a third description of geometric objects. This third 
description will generalize the first two frameworks. 

Given a contravariant functor M : DIFF =>• SET one can define a new category 
T> M with 

Ob(D- M ) := U N M(N) 
and Mor((a, AO, (/?,/>)) := {a £ C°°(N,P)\M(a)(/3) = a}. 

One then defines a covariant functor F M : D M ==>• DIFF by F M (a: (a,N) — > 
(J3, P)) := a : N — > P. When the contravariant functor is of the form M the associated 
covariant functor will satisfy a number of special properties. The first property that it 
will satisfy is that it will be a fibered category. 

Definition A.3 A fibered category over C is a covariant functor F : D =>■ C such 
that 

(1) For every/: Q> —> C\ € Ar(C) and every D\ such that F{D\) = C\ there is an 
arrow g : D -> D { such that F(g : D -> D{) =/: C -> C x . 

(2) If F(gi) of = F(go), then there is a unique g G Ar(D) such that F(g) =f. 
Exercise A.4 Check that F- : D^ =>• DIFF is a fibered category. 

It is also possible to construct a fibered category associated to a smooth groupoid. Let 
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X be a smooth groupoid and define a category X D with 
Ob(*D):= [ [ C°°(N, Ob(X)), 

/VeOb(DIFF) 

Motif: N -> Ob(X),g: P -» Ob(X)) := 

^ >P,h: N ► Ar(X)) | s{h(p)) =f(p) and ^(p)) = g(<p(p))}. 

The associated covariant functor is given by, 

x F((p : N ^P,h: N —* Ar(X)) := <p: N P . 

Exercise A.5 Check that X F: X D =>• DIFF is a fibered category. (Recall that the 
arrows in a groupoid have inverses.) 

Let F : D ==> C be a fibered category and C be an object of C. We can define a 
fibered category over C, denoted F c , by Ob(F c ) := {£> G Ob(D)| F(D) = C} and 
Ar(Fc) := : Dq —> D\ \F{tp) = idc} - One can see that Fc is a groupoid. 

Exercise A.6 Prove that Fc is a groupoid. 

One often wishes to put additional structure on a category. A good motivating example 
is the category of open sets of a topological space with inclusions as arrows. In this 
setting, one would like to axiomatize the properties of open covers of the original space. 
This leads to the notion of a Grothendieck topology and the notion of a site. Once a 
category has a notion of coverings one can define an analogue of a sheaf. This is one 
way to introduce stacks. We do not need the definitions of Grothendieck topologies or 
sites, but we do use the following definition that we quote from Metzler [1 10] to encode 
the notion of a covering. 

Definition A.7 A basis for a Grothendieck topology on a category C is a function K 
which assigns to every object C of C a collection K(C) of families of arrows with 
target C, called covering families, such that 

(1) iff: C — > C is an isomorphism, then {/} is a covering family; 

(2) (stability) if {f : C ; - — > C} is a covering family, then for any arrow g : D — > C, 
the pullbacks C,- Xc D exist and the family of pullbacks C, x c D — > D is a 
covering family (of D); 

(3) (transitivity) if {/j : C; — ► C | i € /} is a covering family and for each / £ /, one 
has a covering family {g,y: — ► C,- |y G /,}, then the family of composites 
{figij : Dij -> C | i € /J G /,} is a covering family. 
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In addition to the usual notion of an open cover in the category of open sets, we obtain 
an example of a basis for a Grothendieck topology on DIFF by considering collections 
of open embeddings whose images cover a given manifold. This example will be used 
in our definition of prestack and stack. 

We now have two different constructions of special fibered categories. One starts with a 
manifold and passes to an associated contravariant functor and then to the associated 
fibered category. The second passes from a manifold to a smooth groupoid to the 
associated fibered category. The fibered categories constructed in either of these ways 
satisfy additional conditions summarized in the definition of a prestack. We take the 
characterization of a prestack given in [1 10, Lemma 25] as the definition of a prestack. 



Definition A.8 A fibered category F : C D is a prestack if and only if there is a 
unique arrow ip : x — * y filling in the dotted arrow in every diagram of the following 
form with F(ip) = 1 . In this diagram C G Ob(C), {C a — > C} is a cover of C and x,y 
are objects of D that map to C and x a G D map to C a under F. In addition we denote 
fibered products as C a p := C a x c Cp (x a p). 





Exercise A.9 Check that F- and X F are prestacks. 



We finally come to the definition of a stack. The definition we give is not the usual one. 
It is instead the characterization given in Metzler [110, Lemma 32]. We chose to use 
this as the definition because it was the quickest way to a clean definition. For the more 
typical description of a stack in terms of descent data see the full exposition in [110] or 
Behrend [28]. 



Definition A.10 A prestack F : D ==>■ C is a stack if and only if for every cover 
{C a — > C} in C and x a mapping to C a satisfying the following commutative diagram 
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for all index triples 

x af3 >■ X a C a f3 >■ C a 

y X F 

X a py^X aj Xfl — s, C Q /3 7 C aj Cp 
"X X XX 

xp-y ^ Xy Cj3^ s- 

there is an object x in D mapping to C and arrows {x a — ► x} filling in the commutative 
diagram 

■^a C a 

S \ xx 

X / X / 

Exercise A.ll Prove that F- is a stack. 

One may be expecting an exercise to prove that X F is a stack. However, the fibered 
category associated to a typical groupoid is not a stack. A good example to consider 
is the groupoid that arises from the standard two-chart atlas of CP 1 . In this case 
the smooth groupoid has Ob(X) = C]JC corresponding to the two charts and 
Ai(X) = C\JC\JC X \JC X corresponding to the overlaps. The objects of the domain 
category of the associated fibered category are smooth maps from smooth manifolds 
into Ob(X). The problem arises when one considers a map into CP 1 with image that 
is not contained in one of the coordinate charts, for example x : C — > CP 1 given by 
x(z) := [z — 1 : z + 1] . By restricting to the charts and their overlaps we obtain objects 
of the domain of the fibered category x±: C — {=Fl}^ CP 1 given by x±(z) '■= - 
These objects map to the charts under the fibered category and the charts form a cover 
of CP 1 . The pull-backs of the cover and the x± satisfy the diagram in the hypothesis of 
the definition of a stack but not the required extension property. It is possible to stackify 
a prestack by declaring objects to be equivalence classes of objects over elements of 
covers as in the hypothesis of the definition. This is similar to the sheafification of a 
presheaf. See Metzler [1 10] for details. When the prestack associated to the groupoid 
associated to an atlas is stackified, the domain category has objects that correspond 
exactly to smooth maps into the given manifold. Thus the constructions of X F and F- 
agree when one starts with a smooth manifold and stackifies. 

Exercise A.12 Prove that the pull-backs of the cover and the x± satisfy the diagram in 
the hypothesis of the definition of a stack. 
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We now need to consider maps of stacks. As motivation consider what can be constructed 
between the stacks associated to a pair of manifolds from a map between the manifolds, 
/ : M —> N '. Recall that the objects of the domain category of the stack associated to M 
are just maps a : P —> M. Such a map can be taken to/o«: P —>■ N, which is an 
object of the domain category associated to N. This extends to arrows in the natural 
way to give a covariant functor Af : =>• D^ . This motivates the definition of a 
map between stacks given below. 

Definition A.13 A map between stacks F : D =>- C and G : E =>- C say 
A: F — > G is just a covariant functor A : D ==> E such that F = G o A . 

There are some technical issues that arise when one wishes to define isomorphism of 
stacks, which are best addressed with 2-categories, see [1 10]. 

We now quote some definitions of some properties of stacks and maps of stacks from 
[110]. 

Definition A.14 Let A: F' — > F be a map of fibered categories over C. We say A is 
a monomorphism if, for every object C of C, the functor Ac'. F' c —> Fq on fibers is 
fully faithful. 

Definition A.15 We say a map of fibered categories A: F' — > F is covering (French 
'couvrant') if, for every object C of C, and every object D of Fq, there is a covering 
family {fi : Q — ► C} and for every / an object D, of F^. such that A(D,) = D|c, - 

If F' and F are stacks, then we refer to a covering map as an epimorphism. 

For the next definitions we need the definition of the fibered product or pull-back of a pair 
of maps of stacks. Given A: F — > H and B: G — > H one defines the pull-back stack 
F x H G to be the stack with domain category having objects Ob(Df ) Xob(D«) Ob(Dc), 
that is, ordered pairs (x,y) such that A(x) = B(y). We can generalize the construction 
of a contravariant functor M associated to a smooth manifold to any object C in any 
category C . If the category is reasonably well behaved the associated covariant functor 
F- will be a stack. 

The following definition generalizes the notions of manifold and a submersion between 
smooth manifolds. 

Definition A.16 A stack F is representable if and only if it is isomorphic to a stack of 
the form F- for some object C of the base category C. Let F, G be stacks over C. 
We say that a map A : F — ► G is representable if for every object C of C and map 
B : F- — ► G , the pull-back stack F- xq F is representable. 
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In fact if / : M — > N is a smooth map, the associated map of stacks Af : F- — ► F- is 
representable if and only if / is a submersion, see Metzler [110]. 

Definition A. 17 Let F be a stack over C. We say F is locally representable if there is 
an object C of C and a representable epimorphism A: F c — > F. 

Definition A.18 Let F be a property of maps in C that is stable under pullback. We 
say that a representable map A : F — > G has property F if, for every object C of C and 
map B : F- — > G, the projection F*A : F- Xq F — > C has property F. 

We now come to the result characterizing representable stacks, see [1 10]. 

Theorem A.19 A stack F is equivalent to a stack of the form F- if and only if 

Al The stack F is locally representable by a map A : F- — > F. 

A2 The map A : F — > F x F is proper. 

DM The map A is etaie. 

Rl The stack has trivial automorphisms. 

R2 The map A : F —> F x F is a closed embedding. 

We now come to the definition of a Deligne-Mumford stack, and the conclusion of this 
appendix. 

Definition A.20 An Artin stack is a stack that satisfies Al and A2. A Deligne- 
Mumford stack is an Artin stack that satisfies DM (usually assumed to be over the 
category SCHEME). An orbifold is a Deligne-Mumford stack over the category 
DIFF. 

Exercise A.21 Prove that the example of the orbifold with underlying space S 2 and 
two non trivial stabilizers Z3 and Z2 really is an orbifold. Analyze the automorphisms 
of this stack. 

B Graph contributions to N2 

In this appendix we list the contributions of all of the fixed point components to the 
localization computation of N2 ■ We label graphs according to the conventions depicted 
in Figure 7.2. The first contribution is: 

7(01) = — 32(ao — ai) _2 (ao — «2) 4 (ai — a2) 1 (oo + «i — 2a2) 1 ■ 
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The contribution from 7(02) can be obtained by exchanging 1 and 2 in the above 
expression. The next contribution is: 

1/(010) = 8(a - ai)~ 2 (a - a 2 ) 3 (ai - a 2 T l ■ 

The next is: 

7/(101) = 8(ao — ol\) (ao — a 2 ) (a\ — a 2 ) 

The contributions from 77(020) and 77(202) can be obtained by exchanging 1 and 2 in 
the above expressions. The next contribution is: 

77(012) = (ao — ai) _1 (ao — a2) 3 (ai — a2) ! (2ai — ao — a 2 )~ l ■ 

The final contribution is: 

77(102) = — (ao — ai) ! (ao — a2)~ 1 (ai — a2) _2 (2ao — ol\ — a 2 ) 4 . 

A tedious simplification gives 

N 2 =7(01) +7(02) + 77(010) + 77(020) + 77(101) + 77(202) + 77(012) + 77(102) = 1 . 
This is a remarkable check of the localization formula. 



C Quantum invariants from skein theory 



When Jones introduced his polynomial invariant, he was motivated by representations of 
the braid group arising from operator algebras. Shortly thereafter the theory of quantum 
groups stalled taking off, and Reshetikhin and Turaev stalled work on defining link 
and 3-manifold invariants based on quantum groups. The papers of Witten provided 
additional inspiration and helped them devise their invariants. Since link invariants only 
have to be invariant under Reidemeister but not Kirby moves, a modular structure is not 
required and one can get by using ribbon categories that are much easier to construct. 
For instance, type I representations of f/ 9 (sljvC) with q a free variable rather than a 
fixed number form a ribbon category. We denote the resulting colored link invariants by 
Wf* , where A is a collection of representations, one for each component of the link. 
We should note that mathematicians usually label these invariants by the Lie algebra 
as we do while physicists label the invariants by the corresponding Lie group. There 
is an easier way suggested by H Wenzl [158] to get the same invariant using skein 
relations. Even though this definition is easy to understand from first principles it is next 
to impossible to compute with. In this appendix we first review Wenzl's elementary 
construction and then indicate how the same invariant can be obtained from quantum 
groups. 
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The starting point for Wenzl's definition is the THOMFLYP polynomial. (This 
generalization of the Jones polynomial was simultaneously discovered by several 
different authors and described in a joint paper [60]. The first acronym was HOMFLY 
and this is still in common use. It has since been realized that two other authors also 
deserve credit, so some people append PT to get HOMFLYPT. We prefer to use the 
pseudonym.) This polynomial invariant of links is defined by the recurrence relation 

\h(L + ) - \-*P(L-) = (qs - q-3)P(Lo), 
together with the normalization 

■P(unknot) = — r , 

qs — q~2 

where L± and Lo are three links that differ exactly in the neighborhood of one crossing 
as in Figure C. 1 . The THOMFLYP polynomial is not in fact a polynomial - rather it is a 




Figure C.l: Terms in the skein relation 

rational function of the formal variables qs and As . The Jones polynomial is recovered 
by substituting A = q 2 into the THOMFLYP polynomial. 

Working backwards from Definition 17.68, it is clear that the framed link invariant 
should be defined by 

(91) <L, n (£) = e^^U >^V(L) , 

where ny(L) are the entries of the linking matrix. By inserting a right-handed twist (the 
configuration labeled by 8y in Figure 16.3) into an arbitrary link diagram to get L + and 
a left-handed twist to get L , one can compute that 

V(W) = (q 1/2 - q~ 1/2 )(A 1/2 - A -1/2 ) -1 TO 

where LO is obtained from L by adjoining a completely unlinked and unknotted 
component. The physics literature often uses a slightly different normalization. Namely, 

<5(i) = A E - E -'"' <L) HD, 

Exercise C.l Assume zero framings (self-linking numbers). Compute 

W*f>m Hopf link) = ( X \~ X ~j ) + A -i _ i . 
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Compute W^ u n (yv) (right Hopf link) and W^ (N \\eft (2,3) torus knot). (The left Hopf 
link is depicted in Figure 16.2) 

The answer to the left (2, 3) torus knot may be found in Marino [103]. 

There is a different invariant of framed links that may be constructed out of The 
THOMFLYP polynomial. It is 

P a {L) := a E '='^='"*P(L). 



To go further, we specialize by evaluating these polynomial invariants at q 2 =e m /(N+k) ^ 
\2 =e ' N7T /( N + k ) . Let 7 denote a multi-index of length c(L), that is, a c(L)-tuple of 
positive integers. We will let I7I denote the sum of the components of 7, and define 
L 7 to be the framed link obtained from L by replacing the p th component of L by 7 P 
parallel copies. Define a number by 

(\ /\ L 2-\- L 2\~ p 
T := - lim — i r V{{\ -framed unknot^) 



Finally, Wenzl defines an invariant by 

T fW(M,0) := 



(-p \ cr(L M )-c(L)+\ 
w\) P x ™L p ~ ciL)TC{L) E e " l7l ^( L M), 

ina\|- 1 /> 



where 



e 



A5 - \-2 



The advantage of this definition is that all of the ingredients are elementary. The 
disadvantages are that one has to work to prove that it is well-defined, and it is not 
obvious how one can compute based on this definition. Notice that the 1 -framed unknot 
squared is just the right Hopf link, and then try to compute V((l -framed unknot) 3 ). 
This is fairly difficult. Computing the invariant for higher cables without some trick 
looks hopeless. Wenzl proves that this invariant is indeed well-defined and is equal to 
the Reshetikhin-Turaev invariant (see Wenzl [158]). 

To prove that Wenzl's invariant is equal to the quantum group invariant one notices 
that Q q x ® Qft = ^2 ve iN^ Ql, where N% are constants determined by the structure 
of a modular category. Via repeated application of this formula one may replace 
computations in arbitrary representations by computations for various cables in the 
fundamental representation and then apply the skein relation. 
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D Representation theory of Lie groups and Lie algebras 

This appendix reviews some aspects of classical representation theory that are used in 
this paper. The information we summarize here can be found in the wonderful books by 
Fulton and Harris [62] and Humphreys [79]. We will denote the N x N unitary matrices 
(A^A = /) by U(N). The special unitary matrices are those with unit determinant, 
this group will be denoted by S\J(N). The Lie algebras of these two groups are the 
algebra of Hermitian matrices (A^ + A = 0), denoted un, and trace-free Hermitian 
matrices, denoted by sun , respectively with the standard bracket ( [A , B] = AB — BA ) as 
a product. The complexifications of these two algebras are the collection of all complex 
matrices (fl^C) and the subalgebra of trace-free matrices (st/vC) respectively. Let Ey 
denote the matrix with a 1 in the / h column of the i th row and zeros elsewhere. 

Young diagrams and irreducible representations 

A representation of a group (algebra) is a homomorphism into the automorphisms 
(endomorphisms) of a vector space. All of the groups and algebras defined above admit 
an obvious representation with vector space C N called the fundamental or defining 
representation. 

Remark D.l Of course when we talk about differentiating a representation we are 
talking about a representation of a Lie group. Representations of groups and algebras are 
closely related but there is not an exact correspondence. For example, the fundamental 
representation of GL^C can be differentiated to produce a representation of the algebra 
qI n C. The same representation can be conjugated to give a different representation 
of GLjvC; however, differentiating the conjugate representation will not produce a 
corresponding algebra representation because it will not be complex linear. In the 
other direction one can see that there is a two-complex dimensional representation 
of the Lie algebra S03 via the isomorphism SU2 ■ However there is no corresponding 
representation of the three-dimensional rotation group SO (3). The classical groups 
that we are considering are all matrix groups, so each may be viewed as a subset 
of C N ~ via the fundamental representation. The automorphisms of any complex 
vector space can be described as a subset of complex /it-space in the same way. It 
therefore makes sense to consider polynomial representations, that is, those that can 
be described by polynomials. For simply-connected matrix groups there is a perfect 
correspondence between finite-dimensional polynomial representations of the group 
and finite dimensional representations of the Lie algebra given by differentiation. 



Qeometry & Topology Monographs 8 (2006) 



420 



Dave Auckly and Sergiy Koshkin 



There are several obvious ways to construct new representations from (sets of) old 
representations: the dual, direct sum, tensor product etc. If pk- G — > Aut(V^) for 
k = 1,2 are group representations, then the tensor product representation p\ 
p 2 : G -> Aut(Vi <8) V 2 ) is given by (pi <g) /3 2 )(g)(x y) = pi(g)(x) <g> pi(g)(y). The 
tensor product of algebra representations ^ : g — > End( Vjt) for £ = 1 , 2 is given by 
(Ml ® /^X^X* <8> y) = /ii(A)(x) <g> y + x <g> /U2(A)(y). This is obtained by differentiating 
the tensor product of group representations. 

Additional representations can be constructed by symmetric or anti-symmetric tensors. 
All of the symmetries that we need can by constructed using Young symmetrizers arising 
from Young diagrams. A typical Young diagram is displayed on the left in Figure D.l. 

A Young diagram represents a partition of a positive integer, £ = £\ + £% H h with 

£j > £j+i ■ It is standard to denote such a partition by A and we use the same notation 
for the Young diagrams as well. The Young diagram in Figure D.l corresponds to the 
partition 6 = 4 + 2. A Young tableau is a Young diagram filled in with natural numbers 
according to some rules. Young diagrams describe vector spaces of representations 
while Young tableau describe special basis vectors in those spaces, namely weight 
vectors. The Young tableau describing what we will later define as the highest weight 
associated to a Young diagram is obtained by filling in a diagram in a specific manner 
(top row with ones, second row with two's, etc) starting in the upper left corner and 
ending in the lower right one (see the right side of Figure D. 1). 



1 


1 


1 


1 


2 


2 





Figure D.l: Young diagram 

The point is that a Young diagram encodes an endomorphism of the ^-fold tensor 
product of any vector space. Construct a specific Young tableau by filling the Young 
diagram with the numbers 1 through £ filling in rows starting in the upper left. If A is a 
Young diagram, a\ will denote the endomorphism of the tensor product that takes a 
tensor product of vectors to the sum of the permutated tensor products by permutations 
preserving the numbers in the rows of the Young tableau. Another endomorphism, 
b\ , is defined analogously but with the alternating sum (according to the signs of 
permutations) over permutations that preserve numbers in the columns of the Young 
tableau. In the example from the figure, 

^(^1,2,3,4,5,6) = ^1,2,3,4,5,6 ~ ^5,2,3,4,1,6 ~ ^1,6,3,4,5,2 + ^5,6,3,4,1,2, 
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where £1,2,3,4,5,6 denotes e\ e2 <8> £3 <8> «4 <8> £5 <8> «6 and is a basis for V. The 
map a\ can be computed similarly; it is a sum of 48 terms. 

The Young symmetrizer is the composition of these two endomorphisms c\ = a\ o b\ 
(a sum of 192 terms in our example). The image of a Young symmetrizer c\ applied 
to a tensor product of £ copies of the fundamental representation is a denoted by V\ . 
Computing a basis for V\ directly from the definition is a little cumbersome. We will 
first consider two special cases corresponding to a diagram with one row or one column. 

Example D.2 For a diagram with just one row, the partition is just 1 = 1. The a\ 
endomorphism is just the sum over all permutations and the b\ is the identity map. It 
follows that Ve=e is just the symmetric product of I copies of ( Sym^C^). Similarly, 
for a diagram with just one column the partition is just I = 1 + 1 + • • • + 1 . This time 

a\ is trivial and b\ is just the alternating sum over all permutations so V^=i + n h i is 

just the £-th exterior power (f\ e C N ). 

Exercise D.3 Define the natural action of any of the classical matrix groups or algebras 
on the set of homogeneous polynomials of degree £. Show that this representation is 
isomorphic to Syn/C^. 

Remark D.4 Whereas the £-th symmetric power is nontrivial for all £, the £-th 
exterior power is trivial unless £ < N. Furthermore, the group representation on A N C N 
is given by the determinant, so the corresponding SU(N) representation is trivial unless 
£ < N. Likewise the algebra representation on the top exterior power is given by the 
trace so the su^ and st^C representations are trivial unless £ < N. The same comments 
hold for any Young diagram: The GLatC, gl^C and \J(N) representations are trivial 
unless the diagram has less than or equal to N rows and the SL^C, sl^C and SU(A^) 
representations are trivial unless the diagram has strictly less than N rows. 

When A has just one row, the set of vectors represented by all possible ways of filling 
in the Young diagram with a non-decreasing sequence of integers between 1 and N 
inclusive is a basis for V\ . When A has just one column, the set of vectors represented 
by all possible ways of filling in the Young diagram with an increasing sequence of 
integers between 1 and N inclusive is a basis for V\ . In general, V\ has a basis 
consisting of all ways of filling the Young diagram with a sequence of integers between 
1 and N inclusive so that the numbers do not decrease as one reads across rows and so 
that the numbers strictly increase as one reads down columns. This is exactly the rule 
specifying the Young tableau alluded to earlier. 
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The action of a matrix on the vector space V\ can be described easily: the matrix acts 
on the tensor product of the vectors with the given indices either using the group or 
algebra action as appropriate, the answer is fully expanded and written as a combination 
of terms in the standard non-decreasing/increasing order. 

As we go further, we will concentrate on finite-dimensional representations. A finite- 
dimensional representation is called decomposable if it can be expressed as a nontrivial 
direct sum. It is indecomposable otherwise. A representation is reducible if it contains 
a nontrivial subrepresentation and irreducible otherwise. In all of the cases that we 
consider, every finite-dimensional representation will be a direct sum of irreducible 
representations. The irreducible representations correspond to Young diagrams. It is not 
immediately clear that the representations V\ are irreducible or that every irreducible 
representation is of this form, but this is indeed the case. See the book by Fulton and 
Harris for the complete story [62]. 

To be specific irreducible polynomial representations of GL#C, Ql N C and U(N) 
are indexed by Young diagrams (partitions) with less than or equal to N rows and 
irreducible polynomial representations of SL^C, sforC and SU(iV) are indexed by 
Young diagrams (partitions) with strictly less than N rows. It is not accidental that 
irreducible representations for the different groups and algebras here are indexed by 
the same sets. The fact is that every irreducible representation of GL^C restricts to 
an irreducible representation of U(A0, which is its maximal compact subgroup and all 
of the latter are obtained in this manner. The same thing happens with SL^C, SU(^V), 
g[ w C and stjvC. 

We can encode our discussion up to this point in a definition. 

Definition D.5 Let A be a Young diagram with rows of length l\ , . . . , and columns 
of length m\ , . . . , . The associated standard tableau is obtained by filling in the 
diagram with the numbers one through £ := l\ + • • • + In across rows starting in 
the upper left. The associated tableaux are obtained by filling in the diagram with 
some numbers between one and N such that numbers are non-decreasing along rows 
and strictly increasing down columns. Let A\ = 0^ x • • • x &g N embedded as the 
subgroup of the permutation group preserving the rows of the standard tableau. 
Let B\ = <S m , x • • • x (5„ lf[ embedded as the subgroup of the permutation group 
&£ preserving the columns of the standard tableau. The associated elements of the 

group ring are a\ := Yj<jza x "> := X^eB A ( _1 ) <Jcr and tne Youn § symmetrizer 
c\ = a\ o b\. The Specht module V\ is the image of the tensor power V® 1 under the 
natural action of the Young symmetrizer. 
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This is starting to get complicated. Every irreducible representation is generated by 
what are called weight vectors. The weight vectors in one of the V\ representations are 
just the vectors represented by Young tableau. 

Example D.6 For example in the sUC representation given by V ^jn the vector 
obtained by filling the first row of the diagram with the sequence 1,1,2,3 and the 
second row with 2,4 is a weight vector. It can be written as 2^(21,1,2,3,2,4)- Let's 
see how several different elements of SI4C act on the vector from our example. The 
matrix £21 maps e\ to 22 and the rest of the vectors to 0. So our weight vector will 

get mapped to 2^(22,1,2,3,2,4) + £^(21,2,2,3,2,4) = £^(21,2,2,3,2,4). The matrix £ 14 
would replace the 4 in the second row by a 1 , but this is just the zero vector. For a more 
complicated example notice that 

£2lCgp(Cl,l,2,2,3,4) = Cgp (^2,1,2,2,3,4) + Cg=n(<?l,2,2,2,3,4) 

= 2 Cffn (21,2,2,2,3,4) + £"^(21,2,2,4,2,3) - £^(21, 2,2,3,2,4) • 
Exercise D.7 Verify that dim(y gD ) = 126. 

We will concentrate on Lie algebra representations for a while. We can see more about 
these representations once we know more about Lie algebras. The Cartan subalgebra is 
a maximal abelian (every bracket is zero) subalgebra. The Cartan subalgebra is usually 
denoted by h . 

Exercise D.8 Show that it is possible to simultaneously diagonalize a set of Hermitian 
matrices exactly when the matrices in the set commute. 

Example D.9 Using the previous exercise by changing the basis one can take the 
subset of diagonal matrices as the Cartan subalgebra. The Cartan subalgebra of gl N C is 
generated by the matrices En. For sl/yC we can take En — £ , ,+i,+i as a basis for the 
Cartan subalgebra. 

The dimension of the Cartan subalgebra is called the rank of the group or algebra. It is 
denoted by r in general and is N — 1 for sljvC. 

Just as eigenvalues are important invariants of matrices, weights are important invariants 
of a representation. 

Definition D.10 The weights of an arbitrary representation /x: q — > End(V) are 
linear functional oj : h — > C that have an associated non-zero weight vector, v u G V, 
satisfying ^i{H)v w = uilffv^ for every //eh. The set of all weight vectors (including 
zero) is called the weight space and is denoted by V w . The set of all weights of all 
representations is called the weight lattice and is denoted A w . 
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Thus we see that weights are just eigenvalues of families of operators, weight vectors 
are just the corresponding eigenvectors and weight spaces are just the corresponding 
eigenspaces. The sum of weights corresponding to the same representation may not be 
a weight of the same representation; however, it is a weight in the tensor product of the 
representation with itself. It helps to consider the weights of all possible representations 
together. These form a subgroup of the dual to the Cartan subalgebra h* . 

Exercise D.ll Prove that the set of weights is a subgroup of the dual h* . Hint: think 
about the tensor product and dual of representations. 

Exercise D.12 Verify that Young tableaux always represent weight vectors. 

Example D.13 Recall the weight vector va = Cgjn(ei, 1,2,2,3,4) from Example D.6. 
The elements of the Cartan subalgebra act as follows: E\\ — E22 multiplies our weight 
vector by zero since there are two ones and two twos, £22 — ^33 multiplies the vector 
by 1 = 2 — 1 , and £33 — £44 multiplies the vector by = 1 — 1 . Thus v\ is indeed a 
weight vector. 

It is helpful to write the weights in coordinates. Recall the standard £„ basis of the 
Cartan subalgebra of qI n C. Let E* t denote the dual basis and set 

N 

i=i 

then the Cartan subalgebra of sl^C contains exactly the matrices annihilated by /*. 
Therefore we can identify its dual h* [jv with the subspace of h* [w that annihilates /. 
Thus h* [jv is generated by E\ — E* i+lj+l . It is not at all obvious, but the weight lattices 
of gl N C, sInC are generated over Z by E* u and 



Li :=£*•- -/* 



respectively. 



Example D.14 Given this notation, the weight corresponding to the weight vector of 
our Cg^C^i, 1,2,2,3,4) example is E* n + E\ 2 - \P = L\ + L 2 . 



Exercise D.15 Assuming that the irreducible representations are exactly show that 



the weight lattice of sl^C is generated by L; := E* u — if . 



Every Lie algebra has one special representation called the adjoint representation. 
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Definition D.16 The adjoint representation is ad : g — > End(g) given by ad(X)(Y) = 
[X, Y] . The weights of the adjoint representation are called roots. Roots are typically 
denoted by a. The corresponding weight vectors are called root vectors. The set of 
roots is denoted by A and the subgroup of h* generated by the roots is called the root 
lattice and is denoted A r . 

The adjoint representation is not necessarily irreducible. If it is the algebra is called 
simple. For example, sl^C is simple and qI n C is not. 

Exercise D.17 Find the Young diagram corresponding to the adjoint representation of 
slfjC . You can check your answer by computing the dimension. 

It is standard to pick a vector in t) that is not annihilated by any non-zero weight to 
measure the heights of weights and roots. This vector can be chosen so that any weight 
evaluated on this vector gives a real number. We can take the matrix 

_ t 

Ht := tt^-^h + tt n - 2 E 22 + ---E m -/ , 

7T — 1 

where / = J2^a to define heights for sl^C 

Exercise D.18 The highest weight vector in our sample representation is then the one 
displayed in Figure D.l. 

Definition D.19 The roots that evaluate to a positive number are called the positive 
roots. The set of positive roots is denoted by A + . The positive roots that cannot be 
written as a sum of positive roots are called simple roots. 

The negative of any root is also a root. Any Lie algebra may be written as 

= h (Q^ a © 8a) , 

aeA+ 

where Q± a is the eigenspace of f) with eigenvalue ±a. 

Definition D.20 A weight vector in a representation that is annihilated by : — 
©oeA+ 0" is called a highest weight vector. The corresponding weight is called a 
highest weight. 

The good news is that this is all very explicit for sljyC 
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Example D.21 The roots of sl^C are just ag := E\ — £?■. The positive roots are 
those with i < j, and the simple roots are those with 7 = 1+1. We use the notation 
a,- = E*j — E* i+u+l for simple roots. The root vectors are eg := Eg and fg := Ep 
The subspace s[j\rC+ is generated by Eg with i < j, thus the weight vector from our 
c^p(<?i, 1,2,2,3,4) example is not a highest weight vector since it is not annihilated by 
■En- 
Remark D.22 The vector in a representation with highest weight as measured by 
the height Ht is a highest weight vector. In general the highest weights of a fixed 
representation are the weights with maximal height in each of the irreducible factors. 

The notation g_ := © ae ^+ 0- a will a l so be useful. If v\ is a highest weight vector in 
a representation, the space V\ (notice the duplicate notation from our discussion with 
Young diagrams) that is generated by (h ©g_)vA will be an irreducible subrepresentation 
and the original representation will be the direct sum of all such. This is a very powerful 
idea that we should elaborate. We begin with an example. 

Example D.23 For st^yC the space s^C_ is spanned by the matrices Eg with 
i > j. For the representation described in Example D.6 the highest weight vector 
V pjn := c^p(<?i, 1,1, 1,2,2) is clearly a weight vector. The fact that it is a highest weight 
vector allows one to reconstruct the entire representation. In order to construct a basis 
for the representation one should just start multiplying the vector by the matrices Eg 
with i > j. For example, one should add 2?2iCgjn(£i, 1,1, 1,2,2) = 4c^n(ei, 1,1,2,2,2) to 
the basis. Relations in the Lie algebra determine the action of the rest of the Lie algebra 
on this vector. The first thing that one can see is that it is a weight vector. This is 
because the bracket of any element of the Cartan subalgebra with £21 is a multiple of 
£21, for example [En — £22, £21] = 2£2i ■ This implies that 

(£ll — £22)£2lV^n = [£ll — £22,£2l]V|jp +£n(£ll — E22)v^pj 

= 2£21v ^n + 2£21v ^n . 

Using the bracket [£12, £21] = £11 —£22 one can compute £i2£2iv^p. By continuing 
in this way adding additional vectors such as £32£2iv^p and 2?2i v EEp etc - The entire 
representation may be reconstructed. 

There are two important things to remember from this example. The first is that any 
finite-dimensional representation is a direct sum of weight spaces: V = ffi^V" . The 
V w are not representations of the full Lie algebra, they are only representations of 
the Cartan subalgebra. This weight space decomposition completely determines the 
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action of the Cartan subalgebra according to /z(//)v = uj(H)v. This follows from the 
natural generalization of [En — E%%^E%{\ = 2E%\. In fact when E £ Q a , one has 

pyu yui+na 

The second thing to remember is how the entire representation could be reconstructed 
from the highest weight vectors. To formalize this define the universal enveloping 
algebra Uq to be the associative unital algebra generated by q subject to the relations 
[X, Y] = XY — YX. For sl^C this is just the matrix algebra structure. The point is 
that in a general Lie algebra one cannot multiply elements; one can only take brackets. 
The universal enveloping algebra allows one to multiply elements. Let U± be the 
subalgebras generated by f) © q± respectively. Specifying a weight vector v\ with 
weight A specifies a t/+ -module structure on C thought of as the vector space generated 
by . The formal way to extend this to a representation of the universal enveloping 
algebra (and thus to the Lie algebra) is as the £/g-module Uq ®u+ C. 

Remark D.24 We can summarize these two points by stating that every irreducible 
representation is generated by words in t/_ multiplied by a highest weight vector and 
the weight space decomposition can be used to determine which words act trivially (see 
Example 17.14 and the discussion after it). 

Summarizing the above discussion we get the following description of the irreducible 
representations in the cases of most interest to us. 

Irreducible complex representations of qI n C are indexed by Young dia- 
grams (partitions) with less than or equal to N rows and irreducible complex 
representations of sInC are indexed by Young diagrams (partitions) with 
strictly less than N rows. 

In the main body of the paper we mention that general finite-dimensional representations 
of quantum groups neither decompose into direct sums of weight spaces nor contain 
highest weight vectors. There is however a class of sub-representations called tilting 
modules that do possess these desirable properties. 

Following the ideas outlined above and in Example D.23 leads to a classification of all 
finite-dimensional representations of any Lie algebra, in particular gtyC, sforC (see 
Fulton and Harris [62]). 

Exercise D.25 Prove that every finite-dimensional representation can be decomposed 
as a sum of the V\ uniquely. 
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Dominant weights and the Weyl character formula 

As nice as the above description might be it is not convenient for generalization to 
quantum groups. We now briefly review a different (but closely related) approach due 
to E Cartan that uses dominant weights instead of Young diagrams. 

Any Lie algebra inherits a bilinear form according to 

(X, Y) = constant • Tr(ad(X) o ad(F)) . 

This pairing is called the Killing form. A Lie algebra is called semisimple exactly when 
this form is nondegenerate. 

Exercise D.26 Prove that every simple algebra, that is, one with irreducible adjoint 
representation, is semisimple. 

There are two standard ways to normalize it. It induces a form on ()* and one can 
require that (a, a) = 2 for short roots or for long roots. In the case of sInC all roots 
have the same length, so there is a standard interpretation of the Killing form. In fact 
it is just given by (A,B) = Tr(AB^). Using the Killing form we may identify the 
Cartan subalgebra with its dual. Under this identification, the coroots are defined by 
a y = 2(a, a)~ l a (so in stjvC there is no difference between the coroots and the roots.) 
The coroot lattice A v is the lattice generated by the coroots. A coroot a v is simple if 
the corresponding root a is simple. If a, are the positive simple roots then the basis 
biorthogonal to a- is denoted by uii (that is, (a/, u)j) = Sy) and its elements are called 
the fundamental weights. The sum of fundamental weights p -=J2i w i a l so plays an 
important role and is called the Weyl weight. 

Exercise D.27 Show that the fundamental weights for s[^C are given by u>k = 
£/=i L i = £i=i E t ~ N 1 * ■ That is ' ( a Y> Vj) = Sy- Also show that the Weyl weight 
can be written as p := ^ J2 a eA+ a > 

which is J2fJi\ N ~ OLi for sl N C. 

Exercise D.28 Show that are (all) the weights of the defining representation of 

sl N C. 

We denote the highest root by 9. The dual Coxeter number of a Lie algebra is 
h y = (p, 9) + 1 . For sInC the highest root is E\ x — E^ N and the dual Coxeter number 
is just N. In the main body of the paper we work only with sl^C. Thus, we often 
assume that the Cartan subalgebra is identified with its dual and drop * and v from 
notation. 
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Exercise D.29 Show that the Weyl weight is given by 

1 N 1 N 

p=-Y,W+l-2k)Et k = -Y,(N + l-2k)L k 

k=l k=\ 

for sl/yC. 

The Weyl group is the group generated by reflections in hyperplanes perpendicular to 
the roots. The reflection corresponding to a simple root is Si((3) = (3 — 2 9^ ,a A a-i ■ For 
sInC the Weyl group is the permutation group <5n- The reflections Si act on the E* kk 
via the permutation (i i+ 1). 

It is time to establish a correspondence between the positive weights of sl/vC and 
partitions (Young diagrams). Note that the matrices L; := E* t — jjl* , 1 < i < N — 1 
may also serve as a basis of A w different from the basis of fundamental weights ioi . 
These are related by, u>i = Yl)=i Lj- Given a highest weight A we have 

N-l N-l i N-l ,N-1 \ N-l 

i=i i=i 7=1 7=1 v i=7 7 7=1 

Since the are non-negative numbers we have A, > , in other words the vector 
(Ai, . . . , Aat_i) is a partition. This is the partition that corresponds to the highest weight 
A and we abuse notation by using the same letter for the weight, partition and Young 
diagram. It is possible to construct a Young diagram from a sum of fundamental weights 
directly. Namely, if A = Ylf=i men the diagram has n/v-i columns with N — I 
boxes, iin~2 columns with N — 2 boxes, etc. Since the A; are the numbers of boxes in 
the rows this provides a simple graphical method of converting sums of fundamental 
weights into partitions and vice versa. Since we already know that Young diagrams 
index irreducible complex representations of s[/yC it makes sense to distinguish their 
counterparts among weights. 

Definition D.30 A dominant weight is a linear combination of fundamental weights 
with non-negative integer coefficients. If A = Ylf=i n i u i * s a dominant weight we 
denote ^(A) := max{/| > 0} the length of A and |A| := Ylf^ Jn ; tne volume of A. 
The set of all dominant weights is denoted A+ . 

Exercise D.31 Show that the Young diagram corresponding to A has ^(A) rows and 
| A| boxes. For every non-negative weight find a vector H G f) such that |A| = (A, H). 
Hint: express H as a sum of coroots and use the biorthogonality relation. In case you 
are wondering: no, H / Ht. 
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The language of dominant weights allows one to characterize irreducible representations 
for all semisimple Lie algebras 

Any dominant weight is the highest weight of some irreducible rep- 
resentation and any highest weight is dominant. Irreducible complex 
representations of a complex semisimple Lie algebra, for example sIa?C 
are indexed by dominant weights. 

We now elaborate on this a bit. 

Definition D.32 If A is a dominant weight we let X\ be the left ideal of £/(sl#C) 
generated by e,- and a- — X(a/). The classical Verma module associated to A is 

V X := U(sl N Q/I x . 

The irreducible representation V\ that we constructed using the Young diagram A 
is isomorphic to the maximal irreducible quotient of V\ . In particular the maximal 
irreducible quotient is finite dimensional. Also every finite dimensional representation 
decomposes into a direct sum of irreducible representations. In addition the irreducible 
representations are simple. One nice proof of all of these facts uses the Weyl unitary 
trick. This goes as follows. 

Any finite dimensional representation of s^C also gives a representation of sun by 
restriction. This in turn induces a representation of SU(N) by exponentiation. It follows 
from averaging over SU(N) that any finite dimensional representation of sI^C admits 
an invariant inner product. This quickly implies that any representation is a sum of 
simple representations and that every representation has a weight. By repeated action of 
the ft on any weight vector one can find a highest weight vector. This is similar to the 
computation in example D.23. The same computation shows that any two irreducible 
representations with the same highest weight are isomorphic. 

Exercise D.33 Write out details for the facts outlined in the previous paragraph. 
Compare with example Example 17.14 and exercise Exercise 17.15 from the main text. 

It is instructive to draw a picture of the weight lattice A w and the subset of dominant 
weights inside of the dual to the Cartan subalgebra. For sl^C this dual can be canonically 
identified with the subalgebra itself via the Killing form. Under this identification the 
lattice in the background of Figure 18.3 is the weight lattice for s^C. 

Exercise D.34 Make a larger picture of the weight lattice for s^C. Given that the 
lattice points at 2/ y/3 and (2 / y/3)e m ^ (viewing the picture in the complex plane) 
correspond to the fundamental weights lui and u% respectively, plot L,-, the roots, simple 
roots, Weyl vector and the set of dominant weights your picture. 
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The final thing we need from the representation theory is the Weyl character formula. 
Strangely enough, we need it for Lie group rather than Lie algebra representations. 
This is because the form of quantum groups (more precisely, quantized enveloping 
algebras) that we are using is 'partially integrated' and analogs of elements in the 
Cartan subalgebra of sl^C belong to SU(/V) rather than sforC Of course, complex 
representation spaces of SU(/V) and sl^C can always be identified since sl^C is the 
complexified Lie algebra of SU(/V) (see Fulton and Harris [62]). The difference is only 
in the operators whose traces are taken for characters. 

We sketch a proof of the Weyl character formula here, but want to mention that there is 
a different proof presented in the book by Simon [144]. 

Let A : G — > Aut(C iV ) be a group representation then its character is defined to be 



A function / : G — > C is called a class function if it is constant on conjugacy classes, 
that is,/(g^;g _1 ) =f(x) for all x and g. Every character is a class function due to the 
cyclic property of traces. 

Exercise D.35 Verify the following basic facts about characters of unitary representa- 
tions: xvew = Xv + Xw, Xv* = X*v and Xv®w = XvXw- 

We will need to use inner products and orthogonality relations for characters. Let G be 
a compact group and dg be the Haar (normalized bi-invariant) measure on it. For two 
continuous complex- valued functions define the inner product 



Jg 

where "stands for complex conjugation. We reserve ( , ) for the inner product in U(/V). 
In the case of the symmetric group dg is just the normalized counting measure and 



The symmetric group ®n can be treated as a subgroup of U(/V) by identifying 
permutations with permutation matrices. Thus any function on U(/V) restricts to the 
symmetric group. 

Given two representations V and W one can define a third as Hom(V, W) with action 
(Af)(x) := A(f(A*x)). Let Hom(V, W) U W be the linear subspace fixed by the action of 
TJ(A0- 
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Exercise D.36 Check that the transformation of Hom(Y, W) U W defined by #(/") = 
J2ae& N a f/N* 1S the identity transformation. Notice that when V and W are irreducible 
representations the dimension of Hom(V, W)^^ is 1 if they are isomorphic and zero 
otherwise. By taking the trace of \I> conclude that (x\, X/*) = $\fi- 

Exercise D.37 By constructing a continuous analog of \£ from the previous problem 
prove that (xx,X/j,) = S X/X . 

Remark D.38 It is a fact that the characters form a complete orthonormal basis for 
the L 2 class functions on a compact Lie group. It follows that the two norms are in fact 
the same. 



We will now describe the irreducible characters of U(A0 and derive the Weyl character 
formula. Since every unitary matrix is conjugate to a diagonal one it suffices to define 
any class function just on the latter. Of course we have to make sure that if two diagonal 
matrices are conjugate to each other our function takes the same value on both. Two 
diagonal matrices are conjugate in U(A0 if and only if their diagonal entries differ by a 
permutation. Hence any class function is symmetric in the eigenvalues of matrices and 
conversely, any symmetric function can be extended by conjugation to a class function 
on the entire group. 

This means that we can describe characters as symmetric functions on the eigenvalues 
of matrices. It is also useful to consider alternating functions, that is, those that satisfy 
Lo(x a (i), ■ ■ ■ ,x ct (ao) = (— l) CT w(xi, . . . ,xm) for any permutation a. One such function 
of particular interest to us is the Vandermonde determinant <5o(x) := det(x^~') (the 
lower index is a coordinate and the upper index is a power). The point is that ratios (or 
products) of alternating functions are symmetric. 

Exercise D.39 Show that det^"') = i\i<j( x i ~ x j)- 

Now we are ready to introduce the symmetric functions that correspond to the characters 
of U(A0. Set 5 x (x) := detOcj^ - ') . Note that this is an alternating polynomial and 5q 
is the Vandermonde determinant. 



Definition D.40 The ratios 

s (x) - W _ det^j 
* XW - S (x)- detOcf-) 

are symmetric polynomials called the Schur polynomials. 
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These were named in honor of I Schur who discovered their connection to character 
theory in 1901. 

Exercise D.41 Prove that these fractions are indeed polynomials. 

Just as the characters of irreducible representations are orthonormal, the Schur polyno- 
mials are orthonormal: 

Sft) = Sx^ . 

In fact we will see that the Schur polynomials evaluated on the eigenvalues of a matrix 
are equal to the corresponding characters applied to the matrix. Our proof will use the 
above orthogonality of the Schur polynomials. A proof of this orthogonality independent 
of the character formula can be found in Fulton and Harris [62]. 

To prove that the Schur polynomials are the characters we will need to use the complete 
symmetric polynomials defined by 

H m (x) := ^x x . 

\\\=m 

Alternatively, they can be described by a generating function: 
Exercise D.42 Show that Uf=i( l ~ tx j)~ X = Hm=Q H m{x)t m . 

The following formula will prove very useful for manipulating Vandermonde-type 
determinants. 

Lemma D.43 Let a,- i = 1 , . . . ,N be a decreasing sequence of integers and set 
I a = { b | b\ > a\ > b% > ■ ■ ■ > b N > au > 0} . 

Then 

N 

deKxf) J](l " xjr 1 = E det ^ ■ 
7=1 bei a 

Proof The proof uses induction on 7Y. The case yV = 1 is standard. Using cofactor 
expansion along the first row gives 

=EL ( - i) " +1 ^ i(i -^ rldet(x P^ i ^n y ^ i -^~ 1 

= V detOth = Y det(xf )• 
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The last equality follows by grouping terms with b\ > /i>2 with terms with b^ > b\ 
when Z?2 > a\ ■ □ 

The next lemma employs the previous relation to prove the so-called Pieri formula. 
Lemma D.44 



S x (x)H m (x) = ^ S\+v(x) ■ 



\v\=m 
0<fi<Ai_i-Ai 



Proof We have 



oo IN 

Y,Sx{x)H m {x)t m = dtt{xf +N - l )\{{\ - txj)- 1 ) /det(xf-') 

m=0 \ y'=l 



rl A l [ detitx^+^Yld - txj)- 1 j /detftxf' 1 ) 

det(ttj ki+I ' i+JV - ! ') j /det(?xf-') 
\J2 det^+^+^O^I^ /det(xf-') 



= J2 S x+v Qc)tM . 

0<i/j<Aj_i-Ai 

Here | A| = ^ A,-, we used Lemma D.43 with a; = A, +/Y— 1 and set Vj = b[ — a,. The 
condition that b G I a is equivalent to v being a partition with < v\ < A,_ i — A,- . □ 

Remark D.45 The Pieri formula has a very nice graphical interpretation - the product 
S\H m is the sum of Schur polynomials with Young diagrams obtained from the Young 
diagram of A in all ways of adding m boxes to it with no two new boxes in the same 
column. We use this interpretation later for computations. 

We can now prove that Schur polynomials are the characters for the representations 
having a Young diagram with just one row. This will provide the base of induction for 
the general proof. 

Lemma D.46 Let the weight mu\ also denote the corresponding Young diagram with 
m boxes in one row, the corresponding partition and representation, then S miA}[ = H m = 

Xmuj t ■ 
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Proof The first equality comes from just applying the Pieri formula with A = 
0. To get the second equality note that a basis of V mu)1 is formed by the vectors 
CmuMT ® ' ' ' ® e N N ^ w i tn \ a \ = m - Here c muJl is the Young symmetrizer and gj form 
a basis in V = C N . Let g G \J(N) act on C N with eigenvalues Xj and eigenvectors ej ; 
then mLOi{g)c mwi {e^ (g> • • • ® e^) = i Q c fflQJ ,(e"' ® • • • It follows that 

Xmwife) = tr(mu;i(g)) = ^ x Q = # m (x) , 

| o I =m 

where x Q := x" 1 ...i*. □ 

The Weyl character formula for U(A0 is the simple equality 

Xx = S\. 

The general proof proceeds by induction on the number of boxes in the Young diagram. 
Lemma D.46 proves the equality for all diagrams with one row. For example 

To illustrate the proof for more rows consider the case of four boxes. Using the induction 
hypothesis and the Pieri formula (D.44) with A = 3n, m = 1 and A = 2u, m = 2 
respectively gives 

XauXn = S rrm + "gxi 

+ +s m . 

Next, using the Pieri formula (D.44) with A = 2a and m = 1 , multiplying the result by 
Xu = Hi, and then using the Pieri formula again gives the next formula. Repeating the 
process further gives the formula after that. 

XcnXaXn = S rrm + 2S^jj + + 
XaXnXaXn = ^ nm + 3>S^n + 2S^ + 3S|jp + Sn . 

Since a product of characters is the character of the tensor product, which in turn can 
be written as a sum of irreducible representations, it follows that all our character 
products can be written as sums of irreducible characters. For example we can write 
XrmXn = Y2a n ^X^ f° r some numbers n^. Since we need these numbers for different 
character products we have to double index them. It is convenient to use as the second 
index the diagram obtained by stacking the boxes in the product one under another. For 
instance, the coefficients of the above sum should be denoted n. - . 

Notice that the representation V ^n is a subspace of ® V u by definition and has 
to appear at least once in the irreducible decomposition. It follows that W n-n mn > 1 . 
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Writing the previous rive displayed equations in the matrix form will make the notation 
and the argument even clearer. 



1 
110 
1110 
12 110 
1 3 2 3 1 



S Mill 




E n rrm fiXn 






E w p^Xm 





















Inverting the matrix implies that we may write S\ 
Taking the norm gives 



E w a m Xm for some integers m AM . 



1 = (5 A ,5 A ) = ^ 



HI 



Working from the top down we see that m AA > 1 at one level implies that = 
and implies that m AA > 1 holds at the next level down. It follows that X\ = S\- There 
is a similar matrix for the Schur functions with A having any fixed arbitrary number of 
boxes. 

So far we only considered irreducible characters of U(A0 but now we can get the 
SU(A0 case for free. Recall that irreducible representations of SU(A0 are indexed by 
partitions of length < N. But we can treat such a partition as being of length < N by 
setting An = 0. This produces an irreducible representation of U(A0 that restricts to 
the original one of SU(A0. Since the characters are just traces the same formula gives 
them for SU(A0 as long as we treat A as an Af-vector with Xn = 0. 



Theorem D.47 (Weyl character formula for SU(A0) Let A denote any dominant 
weight (partition) of length < N and the corresponding irreducible representation of 
SU(N). Then 

Xx = S\ = — 

Exercise D.48 The coefficients defined by x\Xn = Ea^ n \^ u WQ tne c l ass i ca l 
Racah coefficients. Compute n^ WiUJi . 

There is a version of the Weyl character formula valid for any semisimple Lie algebra. 
Even for SU(A0 the formula is more useful to us in this general form. This also gives us 
a chance to introduce notation that comes in handy when considering quantum groups. 
Introduce a parameter q = e' z , z £ C, then the expression q 13 := exp(7z/3) is defined 
for all complex numbers and complex-valued matrices, for example by the power series. 
If (3 G £) for some matrix Lie algebra then q 13 is in the corresponding Lie group and on 
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weight vectors of a representation its action is given by g^v 7 = q^'&v^. The Weyl 
denominator is defined to be 

(92) 8 x {qh := ^-Ifq^+P^ . 

wew 



Theorem D.49 (Weyl character formula) The character xx of an irreducible repre- 
sentation of a semisimple Lie group is given by 

(93) xx = y-. 



Proof for SU(A0 It suffices to consider diagonal matrices (3 = Yli=i A'^ii w i m rea l 
entries and the trace Yli=i A' = 0. It is straightforward to check that f3j = (Li, (3) 
and qP = diag(xi , . . . , x^) G SU(A0 , where we put x\ := qP' . The Weyl denominator 
becomes 



=E„ 6 „(-i)'n;:,(« 



E£i(Ai+Pi)(<T(ii),/9> 



7(1) ■■■ A -a(N) 



=det(xf' +pi ) 

The last equality is the result of Exercise D.27. Our claim now follows directly from 
Theorem D.47. □ 



As an application let us compute xx(q 2p ), where as before p is the Weyl weight. This 
quantity turns up as a 'quantum dimension' in (17.35). First, for any f3 we have 

(94) 6 (q z P)= ^(-l)V w<p),2/3) = det(?^ w -0) 

w£0jv 

= II (q (P ' Li ~ Lj) - q-WM-LA = Yl (q^' a) - q~^ a) ). 
><j aeA+ 
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The third equality is the Vandermonde determinant identity from Exercise D.42. Now 
specifically for q 2p : 

(95) 5 x (q 2p ) = (-1)V V(A+P) ' 2p) = Yl (-1) V Y 2(A+P) ' W(P)) 

= (W (A+p) ) = n ^ (a+p,q) - <r (A+p,a) )- 

q£A+ 

Combining the last two formulas we prove the following. 

Corollary D.50 Let xx be the character of the irreducible representation of sljfC with 
the highest weight X and p be the Weyl weight. Then 

, Q ~ , 2p, W P ) yr ^ (A+P ' a) -^ (A+P ' Q) 



E Exact invariants from conformal field theory 

There is a simple idea that leads to the mathematical definition of conformal field theory. 
To understand this idea, consider the loop group of a Lie group. The loop group is the 
space of all maps from S 1 to the group. Now consider all formal power series with 
values in the associated Lie algebra. An element of this 'loop algebra' produces an 
element of the loop group by a two step process. First consider the formal variable to 
be an element of S l , so the formal power series is a map from S 1 to the Lie algebra. 
Second, exponentiate the answer to obtain a map from S l to the group. In this way 
problems in infinite dimensional geometry may be translated into problems in algebra. 

Let C((0) := {YT=-M a k tk W G C}, and C{tj := {J2T=o a k tk \ a k G C}; then the 
affine Lie algebra associated to g is defined to be 

9 := (g ® C((f))) CK, 

with bracket 

[A ®f + aK,B ® g + 0K] := [A,B] ®fg + (A,B)Res t=0 (f'g)K. 

The affine Lie algebra may be written as a direct sum of subalgebras, 

g = (g ® <C[/]) eg©ff©(g® r l C[r 1 ]). 

Any representation of g, say V, is a (0 8) fC[r]) © © C£ -module with (0 © rC[r]) 
acting as zero, acting as usual, and acting as Hd. The number k is called the 
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level. We will always assume that the level is a positive integer. If R C S and M is an 
R -module, then Ind^M := M (g># S is an 5-module. The Weyl module at level k with 
highest weight A is defined to be ^^f^tcM^gsCK^ and is denoted by V*. Restrict 
attention to s/^C for simplicity. An integrable module at level k is an 5 tjyC -module, so 
that K acts by multiplication by k and (En — Ejj) (g) t n acts locally nilpotently. It turns 
out that the category of level k integrable modules O™' is a strict modular category. 
To define the flip x^f, one uses a creative way to attach integrable modules to a 
Riemann surface at a number of points to obtain what is called the space of conformal 
blocks. This gives rise to the bundle of conformal blocks over the moduli space of 
marked Riemann surfaces of genus g . This bundle admits a projectively flat connection, 
and one can solve the parallel transport (a system of differential equations called the 
Knizhnik-Zamolodchikov equations) to see the effect of interchanging points marked 
with V and W to define the flip. This is not the obvious structure and the resulting 
invariants are far from the obvious ones. See Bakalov and Kirillov [21], Di Francesco, 
Mathieu and Senechal [45] and Kohno [87] for more information. 

Before moving on it is worthwhile to describe the simple objects in this category. The 
affine Weyl group at level k is the semidirect product W£ := W x kk y r acting on h* by 
(s, /3 V )(7) := 5(7) + /3 V , where W is the Weyl group of the associated Lie algebra. The 
interior of a fundamental domain for this action is given by 

(97) 7 = {AG h*|(A + p,a^) >0, (A + p,6> v ) <k + h y }. 

The simple objects in are in one to one correspondence with the weights in I. 
Given a weight A G I, 

4 := y\ ■= V k x /(U(B)(ve®r l ) k -^ +l v x , k . 

Here [/(•) denotes the universal enveloping algebra and v\,k is the highest weight 
vector of V x . 

Even though many physicists approach the Witten-Chern-Simons invariants through 
conformal field theory, we will approach these invariants via quantum groups since it is 
the fastest way to supply the definitions. It has been shown that the two approaches 
agree (see Tsuchiya, Ueno and Yamada [150] and Bakalov and Kirillov [21]). 
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